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NOVEL UBPSrp POLYPEPTIDES 
AND THEIR USE IN THE TREATMENT OF PSORIASIS 



FIELD OF THE INVENTION 

5 This invention relates to a novel gene encoding a protein of the ubiquitin-proteasome 

pathway, UBP8rp. The invention also relates the use of UBP8rp polypeptides for screening for 
modulators, and to the use of said modulators for treating chronic inflammatory diseases such 
as, e.g., psoriasis, psoriatic arthritis, rheumatoid arthritis, asthma, inflammatory bowel disease 
and multiple sclerosis. The invention further relates to the use of biallelic markers located in the 
1 0 UBP8rp gene for diagnosing said chronic inflammatory diseases. 

BACKGROUND 

1. Psoriasis 

15 Psoriasis is a chronic, recurring disease recognizable by silvery scaling bumps and various - 
sized plaques (raised patches). An abnormally high rate of growth and turnover of skin cells 
causes the scaling. The reason for the rapid cell growth is unknown, but immune mechanisms 
are thought to play a role. The condition often runs in families. Psoriasis is common, affecti ng 2 
to 4 % of whites, blacks are less likely to get the disease. Psoriasis begins most often in people 

20 aged 10 to 40, although people in ail age groups are susceptible. 

1.1. Symptoms 

Psoriasis usually starts as one or more small psoriatic plaques that become excessively 
flaky. Small bumps may develop around the area. Although the first plaques may clear up by 

25 themselves, others may soon follow. Some plaques may remain thumbnail -sized, but others 
may grow to cover large areas of the body, sometimes in stri king ring-shaped or spiral patterns. 

Psoriasis typically involves the scalp, elbows, knees, back, and buttocks. The flaking may be 
mistaken for severe dandruff, but the patchy nature of psoriasis, with flaking areas interspersed 
among completely normal ones, distinguishes the disease from dandruff. Psoriasis can also 

30 break out around and under the nails, making them thick and deformed. The eyebrows, armpits, 
navel, and groin may also be affected. Usually, psoriasis produces only flaking. Even itching is 
uncommon. When flaking areas heal, the skin takes on a completely normal appearance, and 
hair growth is unchanged. Most people with limited psoriasis suffer few problems beyond the 
flaking, although the skin's appearance may be embarrassing. 

35 Some people, however, have extensive psoriasis or experience serious effects from 
psoriasis. Psoriatic arthritis produces symptoms very similar to those of rheumatoid arthritis. 
Rarely, psoriasis covers the entire body and produces exfoliative psoriatic dermatitis, in which 



the entire skin becomes inflamed. This fern, of psoriasis is serious because, like a burn, it keeps 
*e skin from serving as a protect bamer a 9 ainst Injury and infacflon. in anrriher uncommon 
tarn of psoriasis, pustular psoriasis, large and sma. pus^illed pimples (pustules) form on the 
palms of the hands and sotee of the feet Sometimes, these pustules era scattered on the body. 

Psoriasismayflare up for no apparent reason, or a flares may resuttfrom severe sunburn 
skin irritation, anttmaiaria dregs, lithium, beta-blocker drugs (such as prepranoloi and 
metoproloi), or aimost any medicated ointment or cream. Streptococcal infections (espeoally ,n 
children), bruises, and scretches can also stimulate the formation of new plaques. 



10 1.2. Mnlocular basis 
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1.2. Mglecuiaroasis . 
Psoriasis Is a chronic inflammatory disease. The inflammatory events ,n psonas,s are 
closed of a complex series of krducttve and ettector processes, which require tttereguiator^ 
expression of various proinflammatory genes. NF-rf. Is a protein ttanscnpbon tecto *a. « 
quired for maximal transcription of many of these prolnAammato* moiecules. tt oonsisto of a 
h lrod.mer of me p50 and p65 protoins retained Inaoflve to the cytoplasm •*%»""> * e 
tohttflcy subuntt ,«B. Upon acttvatton, ,«B is rapidly and sequel P osphcrylafed bytte 
actton Z l*B kinases, ubiquiflnatod. and degraded by me ^^^™**Z 
subuntt (PSO and p66) is ttanslocated to fhe nucleus, where if birds to cognate DM A sequences 
and stimulates gene transcription of proinflammatory genes. 

1.3. Diagnosis . . 

Psoriasis may be misdiagnosed at flrat because many other disorders can produce s,mUar 
piaques and flaking. To confirm a diagnosis, a doctor may perforra a skin biopsy b, removmg a 
skin specimen and examining it under a microscope. 

JvtTa^n has only a few sma. plaques, using ointments and creams that lubricate me 
sMn (emClents) once or twice a day c*n keep me skin moist Ointments 
corifcostorokts. Vttamin D cream, sa»cy,,c add or coa, tor are eflecttve ,n many patten^ 
MM psoriasis. Stronger medications like antra* are used soraetknes, but tttey can ,m«e 
the skin and stain sheets and cfothing. Whan fhe scalp is affected, shampoos contenting these 
active ingredients are often used. For pustular psoriasis, the two most effective medications are 
etttZ and isotreflnoi, whld, are also used to trea, severe acne. Uttra.Ce, W -» - 
he* dear up psoriasis. In fad, during summer monttrs, exposed regions of enacted skrn may 
, dear up spontoneously. Sunbaihing often helps to clear up the plaques on larger areas of the 
body exposure to ultraviolet Bght under controlled conditions is another common therapy. 
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No drug for treating severe forms of psoriasis without severe side effects is marketed yet 
For extensive psoriasis, ultraviolet therapy may be supplemented by psoralens, drugs that make 
the skin extra sensitive to the effects of ultraviolet light The combinati on of psoralens and 
ultraviolet light (PUVA) is usually effective and may clear up the skin for several months. 
However, PUVA treatment can increase the risk of skin cancer from ultraviolet light; therefore, 
the treatment must be closely supervised by a doctor. For most serious forms of psoriasis and 
widespread psoriasis, a doctor may give methotrexate. Used to treat some forms of cancer, this 
drug interferes with the growth and multiplication of skin cells. It can be effective in extreme 
cases but may cause adverse effects on the bone marrow, kidneys, and liver. Another effective 
medication, cyclosporine, also has serious side effects. 

New generation drugs that are currently under development include Efalizumab (Raptiva®), 
an humanized anti-CDHa antibody, it has been shown that Efalizumab, given subcutaneously 
once-weekly, provides clinical benefit in patients with moderate -to-severe plaque psoriasis 
(Cather et al. (2003) Expert Opin Biol Ther. 3:361 -370). Efalizumab offers an new therapeutic 
option for the treatment of psoriasis and the potential for improved and potentially safer long - 
term, continuous "maintenance" therapy. 

2. Psoriasis susceptibility loci 

The multifactorial etiology of psoriasis is well established. Although environmental factors, 
such as streptococcal infections, affect the onset of the disease, family studies indicate a strong 
genetic component. Twin studies show the concordance in monozygotic twins to be 65 to 70% 
(Farber et al., 1974), compared to 15 to 20% in dizygotic twins. Family studies estimate the risk 
to first-degree relatives at between 8 to 23%. However, there are also several known 
environmental factors, including streptococcal infection and stress, that affect the onset and 
presentation of the disease. 

Several psoriasis susceptibility loci have been mapped: PSORS1 on 6p21, PSORS2 on 
17q, PSORS3 on 4q, PSORS4 on 1cen-q21, PSORS5 on 3q21, PSORS6 on 19p, PSORS7 on 
1p, and PSORS8 on 4q31. The loci on 6p and 17q appear to be well established. Additional 
putative psoriasis candidate loci have been reported on 16q and 20p. 

The major susceptibility locus for psoriasis is PSORS1 (Nair et al. 1997; Trembath et al. 
1997; Oka et al. 1999; Lee et al. 2000; Veal et al. 2001). Several positional candidate genes are 
located within the PSORS1 susceptibility locus for psoriasis: HLA-C (the leukocyte antigen C), 
HCR (the a-helix-coiled-coil-rod homologue), POU5F1 (the octamer transcription factor 3), 
TCF19 (the cell growth -regulated gene), the corneodesmosin gene, a gene encoding a plecOn- 
like protein and three genes displaying no homology to any known sequences in any DNA 
database. 



Veal et al. performed a SNP-haplotype-based association analysis of PSORS1 to refine the 
susceptibility locus (Veal et al. 2002). They identified a 10-kb major region for suscepWrty for 
psoriasis. They showed that this restricted region comprised two biailelic markers, SNPs n.7 
and n.9. with probability values clearly exceeding any other markers studied before. Th.s 10-kb 
region did not contain any known gene. In addition, database analysis of this restricted reg.on 
did not allow the identification of any expressed gene, although a non -expressed P"**™ 
was identified. Since SNPs n.7 and n.9 lie in a non coding region, respectively 7 and 4 kb 
centromeric to HLA-C, Veal et al. concluded that SNPs n.7 and n.9 may He within a regulatory 
region influencing expression of HLA-C. 

3 The ubiqultin-proteasome pathway 

The ubiquitin proteasome pathway has a central role in the selective degradation of 
intracellular proteins. Among the key proteins moduiated by the proteasome are those Involved 
in the control of inflammatory presses, cell cycle regulation, cell growth and gene express,on. 
The proteasome is a large multimeric protease present in all eukaryotic cells that exhibns a 
highiy conserved 20S core structure. Proteasomes are responsible for the degradation of 
protein substrates after they have been "tagged" by a poiy-ubiquitin chain. Among others, the 
proteasome is known to be responsible for the degradation of IkB (Regnier et ai. 1997. Cel.. 
90-373-383) Thus proteasome inhibition inhibits NFkB activation by blocking the degradation of 
its inhibitory protein IkB. and inhibition of the proteasome has been proposed as a potential 
mean to treat T cell-mediated disorders such as psoriasis (Zol.ner et al. 2002 J Clin Invest 

109:671-9). , 

The selective degradation of proteins through the ubiquitin proteasome pathway .nvolves the 
activation of a signaling cascade that generates the covalent attachment of a polyubiquitin cha.n 
to protein targets. The polyubiquitin chain formed through the addition of muitipie ub.qu.tin 
molecules to the target acts as a signal for degradation by the proteasome, a iarge mulfmenc 
protein complex. Ubiquitin conjugation requires the presence of three key enzymes: »e 
ubiquitin-activating enzyme E1. the ubiquitin -conjugating enzyme E2 and the ubiquitin hgase E3. 

De-ubiquitinating activities can promote the accumulation of ubiquitin in a given cel. and are 
also thought to counteract the effects of E2/E3-mediated conjugation by removing the 
polyubiquitin chain from conjugated proteins prior to their degradation by the proteasome. Th.s 
might either represent a means of preventing degradation by the proteasome, or might be part of 
those ubiquitination processes not aimed at directing protein degradation. De-ubiquitinat.ng 
enzymes can be subdivided into two broad groups: ubiquitin C-terminal hydrolases (UCHs) and 
> ubiquitin isopeptidases (UBPs) (Wilkinson. 1997). As far as UBPs are concerned, a number of 
published reports indicate that certain UBPs have highly specific functions. IsoT. a member of 
this family that has been studied in detail, is able to cleave both linear and isopeptide -l.nked 
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ubiquitin, and it appears to require a free ubiquitin C -terminus for optimal activity (Wilkinson 
etal. 1995). 

Naviglio et al cloned and characterized the UBP8 ubiquitin isopeptidase in 1998 (EMBO J. 
17:3241 -3250). The biochemical activity of UBP8 was determined, and it was shown that UBP8 

5 can both hydrolyze ubiquitin -isopeptide bonds and cleave purified linear ubiquitin chains. Down- 
regulation of UBP8 generates a substantial derangement of the overall cell protein 
ubiquitination, thus indicating that UBP8 plays a general role in the ubiquitin pathway. Moreover, 
microinjection of antisense UBP8 cDNA in quiescent human cells prevents S-phase entry, and 
microinjection of antisense UBP8 cDNA in growing osteosarcoma cells determines instead an 

10 accumulation of cells in S-phase. Thus Naviglio et al. showed that inhibition of the cellular 
ubiquitin isopeptidase UBP8 has a striking effect on cell proliferation. In 2000, UBP8 was shown 
to associate with Hrs-binding protein both in vitro and in cultured cells (Kato et al. J Biol Chem. 
275:37481-37487). Hrs-binding protein together with Hrs plays a regulatory role in endocytic 
trafficking of growth factor-receptor complexes through early endosomes. Kato et al. 

15 hypothesized that UBP8 associated with Hbp plays a positive regulatory role in proteasomai 
and/or lysosomal degradation of growth factor receptors. 

Accordingly, proteins of the ubiquitin -proteasome pathway have been shown to play an 
important role in, e.g., cell cycle regulation, regulation of cell proliferation and degradation of 
proteins involved in inflammation. Consequently, modulation of proteins of the ubiquitin - 

20 proteasome pathway is a treatment option for cancer and chronic inflammatory diseases such 
as, e.g., rheumatoid arthritis, asthma, inflammatory bowel disease, multiple sclerosis and 
psoriasis. 

SUMMARY OF THE INVENTION 

25 The present invention stems from the finding of an expressed gene located at human 

chromosome 6p21, within the 10-kb region that defines the major susceptibility locus for 
psoriasis. This gene, the UBP8rp gene, encodes a protein of the ubiquitin protea some pathway. 
The UBP8rp gene comprises two introns located at nucleotide positions 1018 to 1046 of SEQ 
ID NO: 1 and 1676 to 1718 of SEQ ID NO: 1. 

30 Therefore, an aspect of the present invention relates to an isolated UBP8rp gene 

comprising introns having a sequence of (i) nucleotides 1018 to 1046 of SEQ ID NO: 1; and (ii) 
nucleotides 1676 to 1718 of SEQ ID NO: 1. 



The present invention further relates to an isolated UBP8rp polynucleotide complementary 
to a messenger RNA transcribed from the UBP8rp gene. 
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The present invention further pertains to a purified UBPSrp polypeptide encoded by the 
UBP8rp gene or by a UBP8rp polynucleotide. 

The present invention is further directed to an expression vector comprising the UBPSrp 
gene or a UBP8rp polynucleotide. 

A host cell comprising the above expression vector is a further aspect of the present 
invention. 

The present invention is further directed to a method of making a UBPSrp polypeptide, said 
method comprising the steps of culturing a host cel. according to the inventio n under conditions 
suitable for the production of a UBPSrp polypeptide within said host cell. 

A further aspect of the invention relates to an antibody that specifically binds to a UBPSrp 
polypeptide. 

The use of a UBPSrp polypeptide as a target for screenin g for natural binding partners, the 
use of a UBPSrp poiypeptide as a target for screening candidate modulators, and the use of a 
modulator of a UBPSrp polypeptide for preparing a medicament for the treatment of a chron.c 
15 inflammatory disease are also within the present invention. 

Further, the present invention pertains to a method of assessing the efficiency of a 
modulator of a UBPSrp polypeptide for the treatment of psoriasis, said method I compn,ng 
administering said moduiator to an anima. mode, for psori asls; wherein a donatio that ^d 
modulator ameliorates a representative characteristic of psoriasis in said animal mode, .nd.cates 
20 that said modulator is a drug for the treatment of psoriasis. 

The present invention is further based on the finding of novel UBP8rp-re.ated biallelic 
markers located wrthin the major susceptibility .ecus for psoriasis. These UBPSrp-related 
biallelic markers are depicted in the table below: 

Biallelic maker No. 1 Position on SEQ iD NO: 1 Alternative nucleotides 



1 



1199 



2 1262 
4 1444 

6 1490 

7 1505 

1630 
1680 
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A/G 
C/T 
G/T 
A/G 
G/T 
A/G 
A/G 



19 IDOU - - 

1? A/G 

A/G 



13 1895 

14 2180 



15 2449 

16 2721 

17 3127 

18 3137 

19 3138 

21 3222 

22 3269 



A/T 

G/T 

A/G 

CfT 

A/G 

C/G 

CfT 



7 



23 


3445 


C/T 


24 


3470 


A/G 


25 


3915 


C/T 


26 


3973 


A/C 


27 


4254 


A/G 


28 


4472 


A/T 


29 


4660 


C/T 


31 


4919 


A/G 


32 


4973 


C/T 


33 


5063 


C/T 


34 


5065 


G/T 


35 


5079 


C/T 


37 


5088 


C/G 


38 


5090 


c/r 


39 


5407 


C/T 


40 


5466 


A/G 


41 


5520 


C/T 



Therefore, in a further aspect, the present invention is directed to the use of at least one 
UBP8rp-related biallelic marker for determining whether there is a significant association 
between said biallelic marker and a chronic inflammatory disease. 

The present invention further relates to the use of at least one UBP8rp-related biallelic 
5 marker for diagnosing whether an individual suffers from or is at risk of suffering from a chronic 
inflammatory disease. 

The invention also concerns a method of genotyping comprising the steps of: (a) isolating a 
nucleic acid from a biological sample; and (b) detecting the nucleotide present at one or more of 
the UBP8rp-related biallelic markers. 

10 

BRIEF DESCRIPTION OF THE FIGURES 

Figures 1Ato 1K show the annotation of the gene encoding UBP8rp. 

Figure 2 shows an alignment between UBP8rp (SEQ ID NO: 3) and UBP8 (SEQ ID NO: 4). 

Figure 3 shows the rhodanese domain of UBP8rp (SEQ ID NO: 3) 

15 

BRIEF PESCRIFTtOftl OF THE SEQUENCES OF THE SEQUENCE LISTING 
SEQ ID NO: 1 corresponds to the genomic region comprising the UBP8rp gene. 
SEQ ID NO: 2 corresponds to the CDS coding for UBP8rp. 
SEQ ID NO: 3 corresponds to the protein sequence of UBP8rp. 
20 SEQ ID NO: 4 corresponds to the protein sequence of UBP8. 
SEQ ID Nos. 5-51 correspond to primers. 



e 
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nrrmcn nPfiCRIPTlO " " c •""= IMVENTION 

The present Invention stems tan the finding of an expressed gene located at human 
chromosome 6p21. witoin toe 10-Kb region the, deflnes the ma,or suscep«, ^stor 
psortasls. This gene nodes for a neve, protein of the ubiquilto -preteasome patoway. UBPSrp. 
NovelblallellcmariterelooatedinmeUBPSrogenearefurmerprovlded. 

Aooordlngly. the present invention provides novel UBPSrp polypepbdes and means to 
toenWytZounds usetol In toe beatoien, of psortas. and other Cronio drseas- 
e^l eg pscrtebc arthritis, rheumatoid arthritis, aslhma, .nnammatory bowel d,se»e and 
rnu^r tresis Spe*a»y. me Invenbon relates to the use of UBPSrp polyposes ae 

* modltore toereof. The use of sad module for 
^ler drenic Inflammatory diseases, and the use of novel b.a«e,io markers tocatod m the 
l£Z gene tor diagnosing psodasis and otoer chronto innammatory diseases are tortoer 
aspects of the present invention. 

1 pn| Y nn^^'^ pre**"* invention 

"a, ret aspeo. of the present Invemion rebates to an fcoiated gene con^nsmg ™™ 
having a sequence of d) nucleotides 1018 to 1046 of SEQ ID NO: 1: and (n) nucleotides 1676 to 

proteinlding sequences of a gene. Inbons are ascribed toto prtmary RNA but are cut out of 
t heprimanrRf«togeneratean»ssengerRNAthatKlstrensletedintoprote.n. 

Tused heL, toe torn, W refers to a sequence of nucieobdes tocatod to . 
■articular posibon on a pedicular diromosome toa, encodes a sp«c«ic pretem. A gene usually 
, ns, imrens, S and , untoanstoted regtons. and upstoeam and — m 

LLry sequences. A gene may encode different Isoforms of toe seme preten. These 
3,1 may be generated by. e.g.. aHernadve spncfng events or star, o, bandahon bore 
rire ilbon codons. The tern, -gene- as used hereto, does no, include 

As tortoer used In tols specHication. toe term -ysffiifcn^" refers to toe gene 
- comprtstg to^lons shown a, nucleobdes 1018 to 1046 and nucteobdes ,676to 1718 of SEO 
T^TZ gene is located attocus 6p2 1. wltoin toe major susoepbbilby locus tor peonage. 

andOT rJ:s U « -— St. « r r 

meanings. However, each term may be subsided tor anotoer hereto to change toe scope of 

36 *" ' nV te n elle,ic vartante o, toe UBP&p gene are encom passed Win toe scope toe present 
tevenbon. Severe, alleles of toe UBP8rp gene are shown on SEQ ID MO: 1. Furthered. 
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procedures known in the art can be used to obtain other allelic variants of the UBP8rp gene 
using information from the sequences disclosed herein. For example, other allelic variants may 
be isolated and identified by making suitable probes or primers from the sequences provided 
herein and screening a suitable nucleic acid source for allelic variants using any technique 
5 known to those skilled in the art. 

Another aspect of the present invention relates to an isolated polynucleotide 
complementary to a messenger RNA transcribed from the gene of claim 1 . 

As further used herein, the term " UBP8rp polynucleotide" refers to an isolated 
10 polynucleotide complementary to a messenger RNA transcribed from the UBP8rp gene, or to a 
fragment thereof. 

As used herein, the term " messenger RNA' 1 (mRNA) refers to the processed RNA 
molecule that does not comprise any intron sequence. The term messenger RNA encompasses 
all alternative splice variants translated from the UBP8rp gene. 

15 Such a messenger RNA may comprise any combination of exon of the UBP8rp gene. In 

one embodiment, the UBP8rp polynucleotide comprises exon 1 comprising nucleotides 1 to 167 
of SEQ ID NO: 2. In another embodiment, the UBP8rp polynucleotide comprises exon 2 
comprising nucleotides 168 to 796 of SEQ ID NO: 2. In another embodiment, the UBP8rp 
polynucleotide comprises exon 3 comprising nucleotides 797 to 1449 of SEQ ID NO: 2. 

20 In a preferred embodiment, the UBP8rp polynucleotide comprises SEQ ID NO: 2. 

Any procedures known in the art can be used to obtain UBP8rp polynucleotides. UBP8rp 
polynucleotides can for example be obtained as described in Example 1. 

The present invention also encompasses UBP8rp polynucleotides for use as primers 
and probes. Such primers are useful in order to detect the presence of at least a copy of a 

25 UBP8rp polynucleotide, complement, or variant thereof in a test sample. The probes of the 
present invention are useful for a number of purposes. They can preferably be used in 
Southern hybridization to genomic DNA. The probes can also be used to detect PGR 
amplification products. They may also be used to detect mismatches in the UBP8rp using other 
techniques. They may further be used for in situ hybridization. Preferred primers of the present 

30 invention are those of SEQ ID Nos. 5-51. 

The present invention also encompasses polynucleotides UBP8rp polynucleotide that 
codes for a fragment of a UBP8rp polypeptide. The fragment may for example consist of an 
antigenic epitope of the UBP8rp and find use in production of antibodies. 

Any of the polynucleotides, primers and probes of the present invention can be 

35 conveniently immobilized on a solid substrate, such as, e.g., a micro array. A substrate 
comprising a plurality of oligonucleotide primers or probes of the invention may be used either 
for detecting or amplifying targeted sequences in the UBP8rp gene, may be used for detecting 
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mutations ,n me coding or ,n t. non-ding sequences ofthe UBPSrp mRNAs srd ^ 
be used to detains expression of UBPSrp mRNAs .n different contexts suoh as In different 
tissues, at different stages of a process (embryo deveiopment. disease treatment), and ,n 
patients versus healthy individuals. 

5 

o Polype ptides off f h« present invention 

Another aspect of the present invention reiates to a purified poiypeptide encoded by the 
UBP8rp gene or by a UBP8rp polynucleotide. 

Preferably, the UBPSrp polypeptide is selected from the group consisting of: 

10 a) a polypeptide comprising SEQ ID NO:3; 

b) aP olypeptidecomprisingaspanofatleast470aminoacidsofSEQIDNO:3; 

c) a po^de comprising a span of at least 15 amino acids of SEQ ID NO: 3, where, 
said span falls within amino acids 467 to 482 of SEQ ID NO: 3; 

d) a mutein of any of (a) to (c), wherein the amino acid sequence has at least 95 h, 96 /o. 
15 97% 98% or 99% identity to at least one of the sequences in (a) to (c); 

e) amuteinofanyof(a)to(c)wh^ 

complement of a DNA sequence encoding any of (a) to (c) under h,gh.y stnngent 

conditions; and 

f) a mutein of any of (a) to (c) wherein any changes in the amino acd sequence are 
20 conservative amino acid substitutions to the amino acid sequences .n (a) to (c). 

The term "j «*m nn.voeotide" is used herein to embrace all of the polypeptides of the 

present >nven^ ^ spends to a full-length UBPSrp 

protein The UBPSrp protein is a member of the ubiquitin proteasome pathway, as desenbed ,n 
25 Example 1 UBPSrp plays a role in the ubiquitin -conjugation and de-ubiquitination of intracellular 
p^n^ 

de-ubiquitinating enzymes. The biologies, activity of a UBPSrp poiypeptide refers to the 
modulation of the ubiquitination state of intracellular proteins by UBP8rp. 

The present invention is also directed to polypeptides consisting of a fragment of at least 
30 10 15 20 30 40. 50, 60, 70. 80. 90, 100. 110, 120, 130. 140. 150, 160. 170. 180. 190. 200. 
0 220 230 240. 250. 260. 270. 280. 290. 300. 310. 320. 330. 340. 350. 360, 370. 380. 390 
40o'. 410. 420. 430. 440. 450. 460. 470 or 480 amino acids of SEQ .D NO: 3. Preferably, sa ,d 
fragmentfallswithinaminoacids467to482ofSEQIDNO:3. 

The present invention is also directed to naturally occuring. recombmant. or ch.menc 
35 polypeptides comprising any of the above fragments. 

One embodiment is directed to a UBPSrp polypeptide wherein the amino acd at pos.t,on 
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107 of SEQ ID NO: 3 is an arginine. Another embodiment is directed to a UBP8rp polypeptide 
wherein the amino acid at position 107 of SEQ ID NO: 3 is a lysine. 

One embodiment is directed to a UBP8rp polypeptide wherein the amino acid at positioi 
128 of SEQ ID NO: 3 is an threonine. Another embodiment is directed to a UBP8rp polypeptide 
wherein the amino acid at position 128 of SEQ ID NO: 3 is a methionine. 

One embodiment is directed to a UBP8rp polypeptide wherein the amino acid at positioi 
183 of SEQ ID NO: 3 is an asparagine. Another embodiment is directed to a UBP8rf 
polypeptide wherein the amino acid at position 183 of SEQ ID NO: 3 is an histidine. 

One embodiment is directed to a UBP8rp polypeptide wherein the amino acid at positioi 
189 of SEQ ID NO: 3 is an asparagine. Another embodiment is directed to a UBP8rf 
polypeptide wherein the amino acid at position 189 of SEQ ID NO: 3 is a tyrosine. 

One embodiment is directed to a UBP8rp polypeptide wherein the amino acid at positioi 

203 of SEQ ID NO: 3 is glycine. Another embodiment is directed to a UBP8rp polypeptide 
wherein the amino acid at position 203 of SEQ ID NO: 3 is a glutamic acid. 

One embodiment is directed to a UBP8rp polypeptide wherein the amino acid at positior 

204 of SEQ ID NO: 3 is an arginine. Another embodiment is directed to a UBP8rp polypeptide 
wherein the amino acid at position 204 of SEQ ID NO: 3 is a lysine. 

One embodiment is directed to a UBP8rp polypeptide wherein the amino acid at positior 
209 of SEQ ID NO: 3 is a glycine. Another embodiment is directed to a UBP8rp polypeptide 
wherein the amino acid at position 209 of SEQ ID NO: 3 is a valine. 

One embodiment is directed to a UBP8rp polypeptide wherein the amino acid at positior 
251 of SEQ ID NO: 3 is an glycine. Another embodiment is directed to a UBP8rp polypeptide 
wherein the amino acid at position 251 of SEQ ID NO: 3 is an arginine. 

One embodiment is directed to a UBP8rp polypeptide wherein the amino acid at positior 
325 of SEQ ID NO: 3 is an glutamic acid. Another embodiment is directed to a UBP8rf 
polypeptide wherein the amino acid at position 325 of SEQ ID NO: 3 is a lysine. 

One embodiment is directed to a UBP8rp polypeptide wherein Ihe amino acid at positior 
420 of SEQ ID NO: 3 is an alanine. Another embodiment is directed to a UBP8rp polypeptide 
wherein the amino acid at position 420 of SEQ ID NO: 3 is a threonine. 

Further embodiments are directed to muteins. As used herein the term "muteins" refers 
to analogs of UBP8rp, in which one or more of the amino acid residues of a natural UBP8rp arc 
replaced by different amino acid residues, or are deleted, or one or more amino acid residues 
are added to the natural sequence of UBP8rp, without lowering considerably the activity of the 
resulting products as compared with the wild-type UBPSrp. These muteins are prepared b) 
known synthesis and/or by site-directed mutagenesis techniques, or any other known technique 
suitable therefore. 



10 



m , teI ns of UBPSrp which can be used in accordance with the present invention, or 
Mut e,ns of UBPft* wh sequences as 

nucleic acid coding thereof, include a tmixe ^ 

conditions. The tenn -setngent conditions- refen, to ima-*-. Soo 

Condons. toose of orttoary M to toe art oonve " Y §6 3 and 6.4 
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(1987. 1992), and Sambrook et el (Sambrook, J. C, Fritech. ^ ^ « Spring 

Molecular Ctonlng: A LaboraW Manuat. Coto Spang Harbor Laboratory Pre 

Harttor.NY). . _.<m™= inrinrie washing conditions 12 -20X 

Vtftoou. MMon. examp.es of stongen. cond*ons .w* 9 5 

Wow toe calculated Tm o, toe hybrid under study tn. e. 9 .. * ^ s £* 37 - C tor 30-60 
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,„ . art understand « -~ ~ — —-e pn^es. I. 

sequence a, «- « ^^^^T*. UBPSRPb pofype^ 
. preferably 70% «* ^^tvlng an an.no actd stance a. leas, for 

example^ Klenboal toaq ry ^peptide l8 Wenllcal to the query sequence 

that toe amino add sequence of toe subieot P«W* atteratlons per 

except toat toe subiect polypeptide sequence may mdude up to Ave am, 

*-i tho nii^rv amino acid sequence. In other woras, to uuia 
30 each 100 amino acids of the query amino jm ^ 

, de — Zener,. toe , sequences to be compared^ - £ • — 

o^on between toe sequences. ™™TT % ^W»<^ ~ - 
sequences, to enhance the degree of alignment. A /« .denaty may 
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whole length of each of the sequences being compared (so-called global alignment), that I 
particularly suitable for sequences of the same or very similar length, or over shorter, definet 
lengths (so-called local alignment), that is more suitable for sequences of unequal length. 
Methods for comparing the identity and homology of two or more sequences are well known in 
the art. Thus for instance, programs available in the Wisconsin Sequence Analysis Package, 
version 9.1 (Devereux J et al. (1984) Nucleic Acids Res. 12:387-395), for example the programs 
BESTFIT and GAP, may be used to determine the % identity between two poly nucleotides and 
the % identity and the % homology between two polypeptide sequences. BESTFIT uses the 
"local homology" algorithm of Smith and Waterman (1981, J Mol Evol. 18:38-46) and finds the 
best single region of similarity between two sequences. Other programs for determining identity 
and/or similarity between sequences are also known in the art, for instance the BLAST family of 
programs (Altschul et al. (1990) J Mol Biol. 215:403-410), accessible through the home page of 
the NCBl at world wide web site ncbi.nim.nih.gov) and FASTA (Pearson (1990) Methods in 
Enzymology, 183:63-99; Pearson and Lipman (1988) Proc Nat Acad Sci USA, 85:2444-2448). 

Preferred changes for muteins in accordance with the present invention are what are 
known as "conservative" substitutions. Conservative amino acid substitutions of UBP8rf 
polypeptides, may include synonymous amino acids within a group which have sufficiently 
similar physicochemical properties that substitution between members of the group will preserve 
the biological function of the molecule (Grantham (1974) Science 185:862-864). It is dearths 
insertions and deletions of amino acids may also be made in the above -defined sequence! 
without altering their function, particularly if the insertions or deletions only invo Ive a few amin< 
acids, e.g. under thirty, and preferably under ten, and do not remove or displace amino acids 
which are critical to a functional conformation, e.g. cysteine residues. Proteins and muteins 
produced by such deletions and/or insertions come within the purview of the present invention. 

Preferably, the synonymous amino acid groups are those defined in Table I. More 
preferably, the synonymous amino acid groups are those defined in Table II; and mosl 
preferably the synonymous amino acid groups are those defined in Table III. 



TABLE I 

Preferred Groups of Synonymous Amino Acids 



Amino Acid Synonymous Group 

Ser Ser, Thr, Gly, Asn 

Arg Arg, Gin, Lys, Glu, His 

Leu "e, Phe, Tyr, Met, Val, Leu 

Pro Gly, Ala. Thr, Pro 

Thr Pro, Ser, Ala, Gly, His, G In, Thr 

Ala Gly, Thr, Pro, Ala 

Val Met, Tyr, Phe, He, Leu, Val 

Gly Ala, Thr, Pro, Ser, Gly 

He Met, Tyr, Phe, Val, Leu, lie 



Phe 

Tyr 

Cys 

His 

Gin 

Asn 

Lys 

Asp 

Glu 

Met 

Trp 
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Trp, Met, Tyr, He, Val, Leu, Pne 
Trp, Met, Phe, He, Val, Leu, Tyr 
Ser.Thr.Cys 

Glu, Lys, Gin, Thr, Arg, His 
Glu, Lys, Asn, His, Thr, Arg, Gin 
Gin, Asp, Ser, Asn 
Glu, Gin. His, Arg, Lys 
Glu, Asn, Asp 

Asp, Lys, Asn, Gin, His, Arg, Glu 
Phe, He, Val, Leu, Met 
Trp 

JABLEJI 



More Preferred Groups of Synonymous Amino Acids 

• ™ Synonymous Group 



Amino Acid 
Ser 
Arg 
Leu 
Pro 
Thr 
Ala 
Val 
Gly 
lie 
Phe 
Tyr 
Cys 
His 
Gin 
Asn 
Lys 
Asp 
Glu 
Met 
Trp 



Synonymous Group 
Ser 
His, Lys, Arg 
Leu, He, Phe, Met 
Ala, Pro 
Thr 

Pro, Ala 
Val, Met, He 

He, Met, Phe, Val, Leu 
Met, Tyr, He, Leu, Phe 
Phe, Tyr 
Cys, Ser 
His, Gin, Arg 
Glu, Gin, His 
Asp, Asn 
Lys, Arg 
Asp, Asn 
Glu, Gin 

Met, Phe, He, Val, Leu 
Trp 

TABLE HI 



Amino 

Ser 

Arg 

Leu 

Pro 

Thr 

Ala 

Val 

Gly 

lie 

Phe 

Tyr 

Cys 

His 

Gin 



Most Preferred Groups of Synonymous Amino Acids 

A ™J synonymous Group 



Ser 
Arg 

Leu, He, Met 

Pro 

Thr 

Ala 

Val 

Gly 

He, Met, Leu 

Phe 

Tyr 

Cys, Ser 

His 

Gin 
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Asn 
Lys 
Asp 
Glu 
Met 
Trp 



Asn 
Lys 
Asp 
Glu 



Met He, Leu 
Met 



Examples of production of amino acid substitutions in proteins which can be used for 
obtaining muteins of UBP8rp, polypeptides for use in the present invention include any known 
method steps, such as presented in US patents 4,959,314, 4,588,585 and 4,737,462, to Mark et 
ai; 5,116,943 to Koths et aL, 4,965,195 to Namen et al; 4,879,111 to Chong et al; and 5,017,691 
to Lee etal; and lysine substituted proteins presented in US patent No. 4,904,584 (Shaw etal). 

Preferably, the muteins of the present invention exhibit substantially the same biological 
activity as the UBP8RPb polypeptide to which it corresponds. 

In other embodiments, UBP8rp polypeptides do not exhibit the biological activity as the 
UBP8RPb polypeptide to which it corresponds. Other uses of the polypeptides of the present 
invention include, inter alia, as epitope tags, in epitope mapping, and as molecular weight 
markers on SDS-PAGE gels or on molecular sieve gel filtration columns using methods known 
to those of skill in the art. Such polypeptides can be used to raise polyclonal and monocl onal 
antibodies, which are useful in assays for detecting UBP8rp expression, or for purifying UBP8rp. 
As a matter of example, a further specific use for UBPSrp polypeptides is the use of such 
polypeptides the yeast two-hybrid system to capture UBP8rp binding proteins, which are 
candidate modulators according to the present invention, as further detailed below. 

3. Vectors, host cells and host organisms of the present invention 

The present invention also relates to vectors comprising the UBP8rp gene or a UBP8rp 
polynucleotide. More particularly, the present invention relates to expression vectors which 
include the UBP8rp gene or a UBP8rp polynucleotide. Preferably, such expression vectors 
comprise a polynucleotide encoding a UBP8rp polypeptide. 

The term " vector" is used herein to designate either a circular or a linear DNA or RNA 
compound, which is either double-stranded or single-stranded, and which comprise at least one 
polynucleotide of the present invention to be transferred in a cell host or in a unicell ular or 
multicellular host organism. An "expression vector" comprises appropriate signals in the vectors, 
said signals including various regulatory elements, such as enhancers/promoters from both viral 
and mammalian sources that drive expression of the inserted polynucleotide in host cells. 
Selectable markers for establishing permanent, stable cell clones expressing the products such 
as, e.g., a dominant drug selection, are generally included in the expression vectors of the 



invention, as they are elements that .ink expression of the drug se.ection markers to expression 
of the polypeptide. 

Additionally, the expression vector may be a fusion vector driving the express.on of a 
fusion polypeptide between a UBPSrp polypeptide and a heterologous polypeptide. For 
5 example, the heterologous polypeptide may be a selectable marker such as, e.g. a lum.nescent 
protein, or a polypeptide allowing the purification of the fusion polypept.de. 

The polynucleotides of the present invention may be used to, e.g., express the encoded 
polypeptide in a host ceil for producing the encoded polypeptide. The polynucleotides of the 
present invention may further be used to express the encoded polypeptide in a host cel. for 
10 screening assays. Screening assays are of particular interest for identifying gators and/or 
binding partners of UBPSrp polypeptides as further detailed below. The po.ynudeot.des .of the 
present Invention may also be used to express the encoded polypeptide in a host organ.sm for 
producing a beneficial effect. In such procedures, the encoded protein may be transrentiy 
expressed in the host organism or stably expressed in the host organism. The encoded 
15 polypeptide may have any of the properties described herein. The encoded polypeptide may be 
a protein which the host organism lacks or, alternatively, the encoded protein may augment the 
existing levels of the protein in the host organism. 

,n one embodiment, the expression vector is a gene therapy vector. Viral vector systems 
that have application in gene therapy have been derived from, e.g., adenovira. vectors and 

20 retroviral vectors. 

Another object of the invention comprises a host cell comprising the UBPSrp gene or a 
UBPSrp polynucleotide. Such host cells may have been transformed, transfected or transduced 
with a polynucleotide encoding a UBPSrp polypeptide. Also included are host cells that are 
transformed, transfected or transduced with a recombinant vector such as one of ttose 
25 described above. The cell hosts of the present invention can comprise any of the 
polynucleotides of the present invention. 

Any host cell known by one of skill in the art may be used. Preferred host cells used as 
recipients for the polynucleotides and expression vectors of the invention include: 

a) Prokaryotic host cells: Escherichia coli strains (I.E.DH5-oc strain), Baciilus suhbLs, 
30 Sa/mone//a typhimurtum. and strains from spec.es like Pseudomonas, Streptomyces 

and Staphylococcus. 

b) Eukaryotic host cells: CHO (ATCC No. CCL-61), HeLa cells (ATCC No.CCL2; 
NO.CCL2.1; NO.CCL2.2), Cv 1 cells (ATCC No.CCL70), COS cells (ATCC NO.CRL1650; 
No CRL1651) Sf-9 cells (ATCC NO.CRL1711). C127 cells (ATCC No. CRL-1804), 3T3 (ATCC 
35 No "l ) human kidney 293. (ATCC No. 45504; No. CRL-1573, BHK (ECACC No. 
84100501; No. 84111301), Saccharomyces cerevisiae strains such as AH109 and Y184, and 
Aspergillus niger strains. 
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Another object of the invention comprises methods of making the above vectors and 
host cells by recombinant techniques. Any well-known technique for constructing an expression 
vector and for delivering it to a cell may be used for construction and delivering the vectors of 
the present invention. Such techniques include but are not limited to the techniques detailed in 
5 the examples. 

Another object of the present invention is a transgenic ani mal which includes within a 
plurality of its cells a cloned recombinant UBP8rp polynucleotide. The terms "transgenic 
animals" or "host animals" are used herein to designate animals that have their genome 
genetically and artificially manipulated so as to include one of the nucleic acids according to the 

10 invention. The cells affected may be somatic, germ cells, or both. Preferred animals are non - 
human mammals and include those belonging to a genus selected from Mus (e.g. mice), Rattus 
(e.g. rats) and Oryctogalus (e.g. rabbits) which have their genome artificially and genetically 
altered by the insertion of a nucleic acid according to the invention. In one embodiment, the 
invention encompasses non -human host mammals and animals comprising a recombinant 

15 vector of the invention or a UBP8rp polynucleotide disrupted by homologous recombination with 
a knock out vector. 

In a preferred embodiment, these transgenic animals may be good experimental models 
in order to study diverse pathologies related to UBP8rp function. In particular, a transgenic 
animal wherein (i) an antisense mRNA binding to naturally occurring UBP8rp rnRNAs is 
20 transcribed; or (ii) an mRNA expressing a UBP8rp polypeptide; may be a good animal model for 
psoriasis and/or other chronic inflammatory diseases. 

4. Methods of making the polypeptides of the present invention 

The present invention also relates to methods of making a UBP8rp polypeptide. 

25 In one embodiment, the UBP8rp polypeptides of the present invention are isolated from 

natural sources, including tissues and cells, whether directly isolated or cultured cells, of 
humans or non-human animals. Soluble forms of UBPSrp may be isolated from body fluids. 
Methods for extracting and purifying natural membrane spanning proteins are known in the art, 
and include the use of detergents or chaotropic agents to disrupt particles followed by, e.g., 

30 differential extraction and separation of the polypeptides by ion exchange chromatography, 
affinity chromatography, sedimentation according to density, and gel electrophoresis. The 
method described in Example 4 may for example be used. Polypeptides of the invention also 
can be purified from natural sources using antibodies directed against the polypeptides of the 
invention, such as those described herein, in methods which are well known in the art of protein 

35 purification. 

In a preferred embodiment, the UBP8rp polypeptides of the invention are recombinantly 
produced using routine expression methods known in the art. The polynucleotide encoding the 



desired po.ypep.ide Is operebly .inked to a promoter into an expression vector <^ 
convenient host Both eukaryotic and P rokaryo«c host systems may be used n tormmg 
rocomHnant poiypep.des. The porypepdde is men fcolated from iysed oeiis „. . «* » torm 
is produced, from the culture medium and purified to the extent needed for ,ts mtended use, 
, Consaquently, a ntoher embodiment o, toe present invenkon is a metood of making a 

polypeptide of the present Invention, said method comprising the ste ps of. 

a) obtaining a polynucleotide encoding a UBP8rp polypeptide; 

b) inserting said pctynucleotlde in an expression vector such that the potynucleokde 
is operably linked to a promoter; and 

10 c, introducing said expression vector Into a host oel, thereby said host cell 

produces said polypeptide. ..... „. 

,„ a prefenad embodiment, the method further comprises the step of rsoiaimg the 
pciypeptlde. The skilled person * appreciate that any step of this metood may be earned cut 

aspect of this embodiment, said po^nucieodde is a polynucleotide comprise SEQ ID NO. 2 o r 

^TnlrLpec, o, toe invent reiatos to a metood of maktog a poiypep.de. said 
» rr«toodcon^ngtoestepsofcu.terir*ahos^ 

a UBPSrp poiynucieotide under conditione suitobie for toe product of a UBPSrp polypept.de 
v»n said host ceU. in a prefenred embodiment toe metood further comprises toe step of 
Dur'rivina said polypeptide from the culture. 

" ^n ale^ embodiment « is often advantageous to add to 
25 polynucleotide encoding a UBPSrp peptide addKiona, nuclide sequent, nM .codes tor 
sJrotory or leader sequences, pro-sequences, sequences v*ich aid in punfteatxm, such as 
^N-*. residues or GST togs, or an add«ona. sequence for stob^ dunng 
rocombinan. produc«on. Soluble portions of the UBPSrp polypeptide may be, e.g.. linked to an 
Ig-Fopartinordertogeneratestablesolublevariante. 

A polypeptide of this invention can be recovered and purified from recombinant cell 
cultures by vLiHtnown metoods inoiudin, but no, limited to dUterentie, extraction, a— 
suifato or etoano, predion, acid exkadion, anion or caaon exchange chromatography, high 
pLroance liquid drromatography, phosphocsiluiose chromategrephy. hydrophobrotnterackon 
drromatography. a«y ohromatogrephy, hydroxyiapakte dtromatography. 
36 immunochromatography and lectin chromatography. 

The expressed UBPSrp polypspkde may be purified using any standard 
^unochromatogrephy techniques, in such procedures, a solution containing the polypepkde 
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of interest, such as the culture medium or a cell extract, fs applied to a column having antibodies 
against the polypeptide attached to the chromatography matrix. The recombinant protein is 
allowed to bind the immunochromatography column. Thereafter, the column is washed to 
remove non -specifically bound proteins. The specifically bound secreted protein is then 
5 released from the column and recovered using standard techniques. 

5. Antibodies of the present invention 

The present invention further relates to antibodies that specifically bind to the UBP8rp 
polypeptides of the present invention. More specifically, said antibodies bind to the epitopes of 
10 the polypeptides of the present invention. The antibodies of the present invention include IgG 
(including lgG1, lgG2, lgG3, and lgG4) t IgA (including lgA1 and lgA2), IgD, IgE, or IgM, and 
IgY. The term "antibody" (Ab) refers to a polypeptide or group of polypeptides which are 
comprised of at least one binding domain, where a binding domain is formed from the folding of 
variable domains of an antibody compound to form three-dimensional binding spaces with an 
15 internal surface shape and charge distribution complementary to the features of an antigenic 
determinant of an antigen, which allows an immunological reaction with the antigen. As used 
herein, the term "antibody" is meant to include whole antibodies, including single -chain whole 
antibodies, and antigen binding fragments thereof. In a preferred embodiment the antibodie s 
are human antigen binding antibody fragments of the present invention include, but are not 
20 limited to, Fab, Fab' F(ab)2 and F(ab , )2, Fd, single-chain Fvs (scFv), single-chain antibodies, 
disulfide-linked Fvs (sdFv) and fragments comprising either a V L or V H domain. The antibodies 
may be from any animal origin including birds and mammals. Preferably, the antibodies are 
from human, mouse, rabbit, goat, guinea pig, camel, horse or chicken. The present invention 
further includes humanized monoclonal and polyclonal antibodies, which specifically bind the 
25 polypeptides of the present invention. 

Preferred antibodies of the present invention recognize an epitope within amino acids 
467 to 482 of SEQ ID NO: 3, wherein said one or more amino -acids are required for binding of 
the antibody to a UBP8rp polypeptide. 

A preferred embodiment of the invention is a method of specifically binding an antibody 
30 of the present invention to a UBP8rp polypeptide. This method comprises the step of contacting 
the antibody of the present invention with a UBP8rp polypeptide under conditions in which said 
antibody can specifically bind to said polypeptide. Such conditions are well known to those 
skilled in the art. This method may be used to, e.g., detect, purify, or activate or inhibit the 
activity of UBP8rp polypeptides. 
35 The invention further relates to antibodies that act as modulators of the polypeptides of 

the present invention. Preferred antibodies are modulators that enhance the binding activity or 



5 



10 



thebiCogicaUctivitycfthe UBPSrp polypeptide to which they bind. These antibodies .ay aot 
as modulators for the biological activity of the UBPSrp polypept.de. 

6 Uses of * h» nolvpeDtide " present Invention, 
' lilnUnLon^^to^euseo. a UBPOrp petypepWe as a for 

pf £ Zeroes * a UBP8* polype. As use, here,, a - 
J i H»t increases or decreases the activity of a UBPSrp polypept.de and/or to a 

*Z2 « 2T- - — - - - — • mRN * * tem 

•mutator- encompasses both agonists and antagonists. 

As used herein, a 'te^ltf refers to a compound mat decreases £ 
UBPSrp polypeptide and/or to a compound that decreases me expression leva. .b«* 
2^22 said po,ypep«de. The ferms -antagonisT and -inhibitor- are ordered to he 
synonymous and can be used Merchangeabty throughout the disclosure. 

AsTsed herein, a -UffiSm^ re.** to a compound that tncreases me a** of a 
UBPSrp polypeptide and/or to a compound that incases ft. express-on leva, of the UBPAp 
.Z encolg sa* po^pepede. The te rms -agonist- and -aCvato, are cnsrdered to be 
svnonymousandcsnbeusedinterchangeablythroughoutthedisclosure. 
, ^Zdsmatcanbeusedfcr^ngmodu^orsformelra^tW^ increase ^ acreage 
acMty of a UBP8RP prxypeptlde or to Ktcrease or decrease the express™ of a WWP 
2!a a. we. Kno»n » the ad and ftalher deWed betov, These assays can be performed 

eccording to the present invendon ^uda £ 
« ™«hetic compounds Such compounds Include, e.g.. natural Uganda, small molecules. 

^Lnd- refers » any agnaiing motecute that binds to a phosphate compnsmg 
P^t^o and Includes merles su* as, e. B „ lipids, nucleotides, polynudeoddes^ 
22£ Pep«des polyposes, proteins, carbohydrates and technic molecules. As used 
„ 2^2SJ2^ re-ers te an o^anlc compound. As user, herein, the t^nr 

at^ds to an antigen. As used herein, the term -antisense «W retera « "* m0leCU ' e 
22122T— nam* pmcessed into mRNA and tran,ated. or elementary 
2221*. As used herein, the term -.to* «•» » an arWcta. nude* acid gan 
35 and Szostex (1990, Nature 346:8,8-822). As used her* fte term ^ 

^^g^-toad^^ 

gene silencing (see, e.g., Bbashiretal. (2001) Nature. 411:49*498). 
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Such candidate compounds can be obtained using any of the numerous approaches in 
combinatorial library methods known in the art, including, e.g., biological libraries, spatially 
addressable parallel solid phase or solution phase libraries, and synthetic library methods using 
affinity chromatography selection. The biological library approach is generally u sed with peptide 
libraries, while the other four approaches are applicable to peptide, non -peptide oligomers, 
aptamers or small molecule libraries of compounds. 

One example of a method that may be used for screening candidate compounds for a 
modulator is a method comprising the steps of: 

a) contacting a UBP8rp polypeptide with the candidate compound; and 

b) testing the activity of said UBP8rp polypeptide in the presence of said candidate 
compound, 

wherein a difference in the activity of said UBP8rp polypeptide in the presence of said 
compound in comparison to the activity in the absence of said compound indicates that the 
compound is a modulator of said UBP8rp polypeptide. 

Alternatively, the assay may be a cell -based assay comprising the steps of: 

a) contacting a cell expressing a UBP8rp polypeptide with the candidate compound; 
and 

b) testing the activity of said UBPSrp polypeptide in the presence of said candidate 
compound, 

wherein a difference in the activity of said UBP8rp polypeptide in the presence of said 
compound in comparison to the activity in the absence of said compound indicates that the 
compound is a modulator of said UBP8rp polypeptide. 

The modulator may be an inhibitor or an activator. An inhibitor may decrease UBP8rp 
activity by, e.g., 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or 100% compared to 
UBP8rp activity in the absence of said inhibitor. An activator may increase UBP8rp activity by, 
e.g., 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or 100% compared to UBP8rp 
activity in the absence of said activator. 

The modulator may modulate any activity of said UBP8RP polypeptide. The modulator 
may for example modulate UBP8rp mRNA expression within a cell, modulate the enzymatic 
activity of the UBP8rp polypeptide, or modulate binding of the UBP8rp polypeptide to its natural 
binding partners. Preferably, the activity of the UBP8RP polypeptide is assessed by measuring 
the ubiquitination state of proteins. 

In one embodiment, the activity of a UBP8rp polypeptide is assessed by measuring the 
ubiquitin-conjugation and/or de-ubiquitination of proteins. Assays for measuring the ubiquitin- 
conjugation and/or de-ubiquitination of proteins are known by those of skill the art. Such assays 



are described, e. fl .. by Navigllo e. a.. 0998. EMBO J. 17:3241-3250, end by Gnesutte e« a>. 
(2001 J Biol Chem. 276:39448-39454). 

,„ a prefened embodiment, foe adivity of a UBP8rp po.ypep.de Is assessed by 
measudng foe WtM * ^^ MS ' ^ <****^ " » 

, ZZ P^ypeP"* •» » ^ te * 9 UBP8 * a UBP8,P ^ " 

ubiauiUnatlon assay described at page 3248 of NavtgBo et at. (1998). 

Corner embedment, ate acflvfly of a UBPBrp Polypep.de ,s assessed by measunng 
the de-ublqulttnatlng activity of UBP8 In the presence of said UBPBrp polypeptide . 

nafutfoer preferred embodiment, the activity of a UBP8RP polypeptide is assessed by 
in measudng foe UBP8n> mRNA levels within a ceil. In .his embodiment, ate activity can for 
IZ be measured using Northern blofo. RT-PCR. duanfoaflve RT-PCR v^-nd 
probes spedlic for UBPBRP mRNAs. Alternately, ate express™ of rite " 
Lsured a. the polypeptide level, by using labeled antibodies foa. specfically bind to Jh. 
measurea an h ass a V s or RIA assays, Western blots or 

UBP8tp polypeptide in immunoassays such as ELISA assays, or ki« as y 

15 immunohistochemical assays. 

Modulators of UBP8.B polypeptides, which may be found. e.g., by any of the above 
screenings are candidate dregs for the treatment of a chrenlc Inflammatory disease. Thus . 
pXTemboolmen, of ate presen, invenflon Is foe use o, a UBPBrp polypeptide - , a Uage. 
I screening candidate compounds for candidate drugs for foe treatment of a chrenlo 

mflammatlon of a tissue or an o^an of an Individua. Chronic intemmatery diseases ^clude, 
e g psoriaste, psoriatic arthrtfls. rheumatoid arthritis, asthma, inflammatory bowel d,sease and 
muWplescleresis.Preferebly. said chrenic inllammatery disease Is psonasts 
* A further aspect of the present invent is foe use o, a modulator of a UBPBre 

polypepflde for screening for dn*s for foe treatment o, a chronic Inflammatory 

o, a mefood foe. can he used for screens for dregs for foe .reatmen. o. - * chrente 
inflammatoo- disease and/or for assessing foe efficiency of an modulator of a UBP8n> 

30 o^dtlistertng said modulafor te an animal mode, for said chrenlc inflammatory d^ease 
a detelnaflon foa, safo modufotor amelfora.es a represenfoflve charactensbc ^ 
lonfo inflammatoo disease in said artmal model indicates foa. sa id modulafor is a drug for 
Z TfoLmen, o, 1 chrenfo inflammatory dfcease. Preferebly. said chronic inflammatory 

35 for chronic Inflammatory dfceases and assays for detemaning whether a 

compound ameiioretes a represented chareoterisflo of foe chrenic inflammatory d,sease ,n 
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said animal model are known by those of skill in the art. A preferred animal model for psoriasfc 
is the SCID-hu mouse that is described in Zollner et al. (2002, J Clin Invest. 109:671 -679). 

Determining whether the modulator ameliorates a representative characteristic of . 
chronic inflammatory disease may be performed using several methods available in the art 
Specifically, when studying psoriasis, the representative characteristic may be the Nation* 
Psoriasis Foundation Psoriasis Score (NPF-PS), the Psoriasis Area Severity Index scor. 
(PASI), or Physician's Global Assessment score (PGA) (see, e.g., Gottlieb et al. (2003) J Drug 
Dermatol. 2:260-266). 

In one preferred embodiment of the present invention, the representative characteristic I: 
the Psoriasis Area and Severity Index score. The Psoriasis Area and Severity Index is i 
measure of overall psoriasis severity and coverage (Fredriksson et al. (1978) Dermatologies 
157:238-244). It is a commonly used measure in clinical trials for psoriasis treatments. 

In a further embodiment, a determination that a modulator of a UBPSrp polypeptid. 
ameliorates the PASI score of an animal model for psoriasis indicates that said modulator is < 
drug for the treatment of psoriasis. Preferably, a 50%, 60%, 70%, 75%. 80%, 85%, 90%, 95<>/, 
or greater improvement in PASI scores indicates that said modulator is a drug for the treatmen 
of psoriasis. Most preferably, a 75% or greater improvement in PASI scores (PASI 75) indicate 
that said modulator is a drug for the treatment of psoriasis. 

A further aspect of the present invention is directed to trie use of a modulator of s 
UBPSrp polypeptide for preparing a medicament for the treatment of a chronic inflammaton 
disease. Such a medicament comprises said modulator of a UBPSrp polypeptide in combinatior 
with any physiologically acceptable carrier. Physiologically acceptable carriers can be preparec 
by any method known by those skilled in the art. Physiologically acceptable carrie rs include bu 
are not limited to those described in Remington's Pharmaceutical Sciences (Mack Publishing 
Company, Easton, USA 1985). Pharmaceutical compositions comprising a modulator of s 
UBPSRP polypeptide and a physiologically acceptable carrier can be for, e.g.. intravenous 
topical, rectal, local, inhalant, subcutaneous, intradermal, intramuscular, oral, intracerebral anc 
intrathecal use. The compositions can be in liquid (e.g., solutions, suspensions), solid (e.g., 
pills, tablets, suppositories) or semisolid (e.g., creams, gels) form. Dosages to be administerec 
depend on individual needs, on the desired effect and the chosen route of administration. 

Such a medicament comprising (i) a UBPSrp modulator, or (ii) a gene therapy vector ol 
the invention may be used in combination with any known drug for the treatment of a chronic 
inflammatory disease. For example, when treating psoriasis, the modulator may bi 
administered in combination with Raptiva. Tazarotene, Anapsos. Alefacept, Micanol, Efalith 
35 Olopatadine, Calcipotrioi, Cyclosporin A, Halobetasol propionate, Halometasone, Acitretin, 
GMDP. Silkis, Betamethasone mousse, Clobetasol propionate foam, Tacalcitol and/o. 
Falecalcitriol. 
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The present invention further relates to the use of a UBPSrp polypeptide for screening 
for natural binding partners. Using a UBPSrp polypeptide as a target has a great utility for the 
identification of proteins involved in psoriasis and for providing new intervention points in the 
treatment of chronic inflammatory diseases. Such methods for screening for natural binding 
partners of a UBP8rp polypeptide are well known in the art One method for the screening of a 
candidate substance interacting with a UBPSrp polypeptide of the present invention comprises 

the following steps : 

a) providing a polypeptide consisting of a UBP8rp polypeptide; 

b) obtaining a candidate polypeptide; 

c) bringing into contact said polypeptide with said candidate polypeptide; 

d) detecting the complexes formed between said polypeptide and said candidate 
polypeptide. 

In one embodiment of the screening method defined above, the complexes formed 
between the polypeptide and the candidate substance are further incubated in the presence of a 
; polyclonal or a monoclonal antibody that specifically binds to the UBPSrp polypeptide. 

In a particular embodiment of the screening method, the candidate is the expression 
product of a DNA insert contained in a phage vector (Parmley and Smith (1988) Gene. 73:305- 
318). Specifically, random peptide phage libraries are used. The random DNA inserts encode 
for polypeptides of 8 to 20 amino acids in length (see, e.g., Oldenburg et al. (1992) Proc Natl 
Acad Scl U S A. 89:5393-5397; Valadon et al. (1996) J Immunol Methods. 197:171-179). 
According to this particular embodiment, the recombinant phages expressing a polypeptide that 
binds to the immobilized UBPSrp polypeptide is retained and the complex formed between the 
UBPSrp polypeptide and the recombinant phage may be subsequently immunoprecipitated by a 
polyclonal or a monoclonal antibody directed against the UBP8rp polypeptide. 

In a further particular embodiment of the screening method, the binding partners are 
identified through a two-hybrid screening assay. The yeast two-hybrid system is designed to 
study protein-protein interactions in vivo (Fields and Song (1989) Nature. 340:245-6), and relies 
upon the fusion of a bait protein to the DNA binding domain of the yeast Gal4 protein. This 
technique is also described in US Patent Nos. 5,667,973 and 5,283,173. The general procedure 
of library screening by the two-hybrid assay may for example be performed as described by 
Fromont-Racine et al. (1997, Nat Genet 16:277-282), the bait polypeptide consisting of a 
UBPSrp polypeptide. More precisely, a UBPSrp polynucleotide is fused in frame to a 
polynucleotide encoding the DNA binding domain of the GAL4 protein, the fused nucleotide 
sequence being inserted in a suitable expression vector, for example pAS2 or pM3. 

In a further particular embodiment of the screening method, the binding partners are 
identified through affinity chromatography. The UBPSrp polypeptide may be attached to the 
column using conventional techniques including chemical coupling to a suitable column matrix 
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(e.g. agarose, Affi Gel®, etc.). In some embodiments of this method, the affinity column 
contains chimeric proteins in which the UBP8rp polypeptide, or a fragment thereof, is fused to 
glutathion S transferase (GST). A mixture of cellular proteins or pool of expressed proteins as 
described above is applied to the affinity column. Polypeptides interacting with the UBP8rp 

5 polypeptide attached to the column can then be isolated and analyzed, e.g., on 2-D 
electrophoresis gel as described in Ramunsen et a/., (1997, Electrophoresis, 18:588-598). 
Alternatively, the proteins retained on the affinity column can be purified by electrophoresis - 
based methods and sequenced. 

In a further particular embodiment of the screening method, the binding partners are 

10 identified through optical biosensor methods (see, e.g., Edwards and Leatherbarrow, 1997). 
This technique permits the detection of interactions between molecules in real time, without the 
need of labeled molecules. 

7, Biallelic markers of the present Invention 

15 The present invention Is directed to the use of at least one UBP8rp-related biallelic 

marker selected from the group consisting of the biallelic markers shown below for determining 
whether there is a significant association between said biallelic marker and a chronic 
inflammatory disease: 

Biallelic maker No. Position on SEQ ID NO: 1 Alternative nucleotides 



1 


1199 


A/G 


2 


1262 


C/T 


4 


1444 


G/T 


6 


1490 


A/G 


7 


1505 


G/T 


10 


1630 


A/G 


12 


1680 


A/G 


13 


1895 


A/G 


14 


2180 


A/G 


15 


2449 


A/T 


16 


2721 


G/T 


17 


3127 


A/G 


18 


3137 


err 


19 


3138 


A/G 


21 


3222 


C/G 


22 


3269 


C/T 


23 


3445 


C/T 


24 


3470 


A/G 


25 


3915 


C/T 


26 


3973 


A/C 


27 


4254 


A/G 


28 


4472 


A/T 


29 


4660 


err 


31 


4919 


A/G 


32 


4973 


C/T 


33 


5063 


C/T 



I 



26 

G/T 



34 5065 

39 5407 

*n 5466 A/G 

40 CfT 



37 5088 

38 5090 g£ 
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As used herein, the i^msl^: «**» *> « P^-Phism having two alleles et 
a famy high frequency In toe poputefl*, preferabry a single nucleotide polymorph^. Typtcafly 
toe fluency of the tess common ailele of the blalleiic markers of me present mvenflon has 
b een vaUdated to he greater than 1%, preferably the frequency Is greater fhan 10* more 
preferab* the frequency is a. least 20% (l.e. heterozygosity rate of at least 0.32), even more 
ate frequency is a, teas. 30% (l.e. heterezygosi* rate o, a, lea, 0.42. in the present 
spedflcaflon, me term tJ*«K Is used te refer horn * me W ^ 
Is canytng me polymorphism. As used herein, the tern, - I IPPB^aM hf^ m^er 
refers to a blalleiic marker located in an axon of the UBPOrp gene, , n an rnfron of me UBP8rp 
T m or in regular regions of ate UBPSrp ger». The term - I mm«**4 h,a M ,c metker- 0 
Z^LJLmZ ^ te BlalleHc markers 1, 2. 4, 6, 7. 10, 12-10, 21-30, 31-35 and 37-41 
shown above and further described in Example 3. 

Deterging whether mere is a significant assertion behaeen said hiallelic marke and 
a cbrorflc Mammal disease can be perfomted using any metood we,, Known by 
, ,„ me art For example, the UBPfcp-related WafleOe maricer of me present .nvenflon may be 
genotyped in case and con.ro. populaflons for me inflammatory disease to be studied. The 
l»c frequency o, marker* between cases and controls may be invented using e.g Jhe 
Pearson Ch, squared test. The EM (Expe^flon-Maximizaflon) algorflhm (Excoffler L & Slaflon 
„. 1995 ) may used to eaflmate hap«ypes for flte populaflon under J"**"; 
0 AJtemaflveiy, haplo^pe frequency esflmaflons may be performed by app*ng toe OMNJBUS 
„ke»hood raflo test (POT publicaflon WO 01,031026). The associaflcn fcdween UB ™£*** 
blaflelic markers of the present Invenflon and psodasis may also be performed as descnbed by 

Veale Trateots and embodiments bating to UBPSrp -related bMelic markers of the 
ffi present invenflon. me dtronlc Inflammatory disease * preferably seiected .rare toe group 
consisflng of psoriaste. psodaflo adhrifls. rheumatoid adhdfls, astoma mtarrxnatory bowe, 
dfe ease and mulflple scterosls. Most p^ ra bly,toe*ron,cinflamn^o,yd B ease«Paonas^ 

The present invenflon la further directed to toe use of a. teas, one UBPSrp-reiated 
btafleflc marker of toe present invenflon for diagnosing wheflter an IndMdual suffere from or ,s a. 
30 risk of suffering from a chronic inflammatory disease. 

,n one embodiment, a single blalleiic marker Is used for diagnosing whether an .ndMdual 
suffere from or is at dsk of suffering frem a chronic inflammatory disease by de.ern.nmg toe 



27 

genotype of an individual. In another embodiment, a combination of several biallelic markers 
may be used for diagnosing whether an individual suffers from or is at risk of suffering from a 
chronic inflammatory disease by determining the haplotype of an individual. For example, a two - 
markers haplotype, a three-markers haplotype or a four-markers haplotype may be determined. 

5 As used herein, the term "genotype* refers to the identity of the alleles present in an 

individual or a sample. The term "genotype" preferably refers to the description of both copies 
of a single biallelic marker that are present in the genome of an individual . The individual is 
homozygous if the two alleles of the biallelic marker present in the genome are identical. The 
individual is heterozygous if the two alleles of the biallelic marker present in the genome are 

10 different. 

The term " qenotyping" a sample or an individual for a biallelic marker involves 
determining the specific alleles or the specific nucleotides carried by an individual at a biallelic 
marker. 

As used herein, the term " haplotype" refers to a set of alleles of closely linked biallelic 
15 markers present on one chromosome and which tend to be inherited together . 

Methods for determining the alleles, genotypes or haplotypes carried by an individual are 
well known by those of skill in the art and further detailed below. 

In the context of the present invention, the individual is generally understood to be 

human. 

20 UBP8rp-related biallelic markers 20 and 36 are highly associated with psoriasis, yielding 

p-values inferior to 10" 9 (Veal et al., 2002). Thus a preferred embodiment of the present 
invention is directed to the use of (i) at least one UBP8rp -related biallelic marker of the present 
invention; and (ii) the biallelic marker 20 and/or the biallelic marker 36 for diagnosing whether 
an individual suffers from or is at risk of suffering from psoriasis . 

25 

The present invention is further directed to a method of genotyping comprising the steps 

of: 

a) isolating a nucleic acid from a biological sample; and 

b) detecting the nucleotide present at one or more of the UBP8rp-related biallelic 
30 markers of the present invention . 

Preferably, said biological sample is derived from a single subject. It is preferred that the 
identity of the nucleotides at said biallelic marker is determined for both copies of said biallelic 
marker present in said individual's genome. In a preferred embodiment, the identity of the 
nucleotide at said biallelic marker is determined by a microsequencing assay. Preferably, a 
35 portion of a sequence comprising the biallelic marker is amplified prior to the determination of 
the identity of the nucleotide. The amplification may preferably be performed by PCR. Methods 



of genotypin, are wel, known by those of skU, in the at and any other known protocol -nay be 
STnlo.de present a. a UBP8n>reteted bfcl* marker of ate present .nvenkon .ay 
for example be determined as described in Example 3. 

Imods well-known to arose skilled ,n are art me. may be used for genotyping ,n order 
,o detent bia«e«c pdymorphtems Inotude methods such as, convendona. dot btot analyzes 
confoLkona, potymorphtsn, anatysis <SSGP, (Odta e, a , (1989, K» « A- 
JuSA 86:2766-2770), denatedng gradlen, go, etectr^ores* (DOGE, p«~r. 
Mute. Res. 202:77*3.), he.erodup.ex analysis (l^ssa e. ai. (1993, Mo. EcoL 2^119 129, 
mtem a«. deavage detection (Grompe e. al. (1989) Proo Na9 Acad So, USA. 86.68^,. 

sHe empioys a specialized exonudease-resWant nudeCkle denvanve as desenbed in US 
4 656127 Okgonudeotlde microarrays or sdid-phase capntrable 
oTtynl^r ~Z~« may afco be used (Wen a. aK (2003) Wo^ 
GasuTnterol. 9:1342-1346; Kim e, a,. (2003) Ana. Btodrem. 316:251-258). Preferred me^ods 
,s ^Tdkedfy determining .he IdemHy o, the nudeokde presen. a. a bialielio marker s,te by 
Cndng assay, microseguencktg assay, enzyme-based mismatch deteckon assay, or 

^ source of nuddc acids, in pudked or non-punned fom, can be u««zed as ate ^g 
30 nuLc add provided » contains or is suspected of containing me specdic nudetc acrd 
s^oeH.. D^orP^Amaybeexb.o te dfn m ^s.6ssues.body fl uk J sandme.,ke 

^Lhod. of geno*pkrg M use k, e.g., m genobyp.ng 

assodakon stadias as weH as In genotypkrg .ndividuals in ate context o. deteckon of alle.es o, 
S«ota2s whtoh are known <o be assodated wkh a given trak. In the context of ft. 

psoriasis, psonakc arftdds, rheumatoid arftnus. asftma. Inflammatory bowe, drsease and 
muWpie sclerosis, and most preferably psoriasis. 

,n one embodiment ate above genotyping meftod further compnses ate step of 
conetakng ate resuk of ate geno.yp.ng steps wkh a rtsk of suffering from a chronic kdlammatery 

30 *""*. presen. kwendon ia further direded ,o ate use of a. leas. 

b ,a.te„c madter o, ate present kwendon tor determining kie haptocype of ar , ,n*«dua, ^ When 
determining ate hapkrtype of an individual each slngte chromosome should be started 
Zlrdly. Memods of determining ate haplotype of an individua. are well known 

35 at to .ude. e.g.. asymmeftc PGR amplkfoakon (Newten e. a,. (1989) Nudeic Adds Res. 
VJZs 6; WU e. a.. (1989, ProcNad. Acad. Sd. USA. 86:2757 -2760). Isolation of stngle 
121 by Iknlted dkulton fokowed by PGR ampMcadon (Ruano e, al. (1990, Proo. Nad. 
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Acad. Sci. USA. 87:6296-6300) and, for sufficiently close biallelic markers, double PCR 
amplification of specific alleles (Sarkar and Sommer, (1991) Biotechniques. 10:436-440). 

Thus the present invention is further directed to the use of at least one UBP8rp-related 
biallelic marker of the present invention for determining the haplotype of an individual. For 

5 example, a method for determining a haplotype for a set of biallelic markers in an individual may 
comprise the steps oft a) genotyping said individual for at least one UBP8rp -related biallelic 
marker, b) genotyping said individual for a second biallelic marker by determining the identity of 
the nucleotides at said second biallelic marker. In one embodiment, both markers are UBP8rp - 
related biallelic markers of the present ivention. In another embodiment, one marker is a 

10 UBP8rp related marker of the present invention and the other biallelic marker is biallelic marker 
20 or 36. 

Methods of determining a haplotype for a combination of more than two biallelic markers 
comprising at least one UBP8RP -related biallelic marker of the present invention in an individual 
are also encompassed by the present invention. In such methods, step (b) is repeated for each 
15 of the additional markers of the combination. Such a combination may comprise, e.g., 3, 4 or 5 
biallelic markers. 

When estimating haplotype frequencies in a population, one may use methods without 
assigning haplotypes to each individual. Such methods use a statistical method of haplotype 
determination. Thus another aspect of the present invention encompasses methods of 

20 estimating the frequency of a haplotype for a set of biallelic markers in a population, comprising 
the steps of: a) genotyping each individual in said population for at least one UBP8RP -related 
biallelic marker, b) genotyping each individual in said population for a second biallelic marker by 
determining the identity of the nucleotides at said second biallelic marker; and c) applying a 
haplotype determination method to the identities of the nucleotides determined in steps a) and 

25 b) to obtain an estimate of said frequency. Such a method may also be performed for a 
combination of more than 2 biallelic markers. Step (c) may be performed using any method 
known in the art to determine or to estimate the frequency of a haplotype in a population. 
Preferably, a method based on an ejspectation -maximization (EM) algorithm (Dempster et al. 
(1977) JRSSB, 39:1-38; ExcofRer and Slatkin, (1995) Mol Biol Evol. 12:921-7) leading to 

30 maximum-likelihood estimates of haplotype frequencies under the assumption of Hardy - 
Weinberg proportions (random mating) is used for performing step (c). 

Having now fully described this invention, it will be appreciated by those skilled in the art 
that the same can be performed within a wide range of equivalent parameters without departing 
35 from the spirit and scope of the invention and without undue experimentation. 

While this invention has been described in connection with specific embodiments thereof, it 
will be understood that it is capable of further modifications. This application is intended to cover 
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any variations, uses or adaptations of the invention following, in general, the principles of the 
invention and including such departures from the present disclosure as come within known or 
customary practice within the art to which the invention pertains and as may be appl.ed to the 
essential features hereinbefore setforth asfollows in the scope of the appended ohm. 

All references cited herein, including journal articles or abstracts, published or unpublished 
patent application, issued patents or any other references, are entirely incorporated by reference 
herein, including all data, tables, figures and text presented in the cited references. Additionally the 
entire contents of the references cited within the references cited herein are also entirely 

incorporated by reference. . „ 

Reference to known method steps, conventional methods steps, known methods or 
conventional methods is not any way an admission that any aspect, description or embodiment of 
the present invention is disclosed, taught or suggested in the relevant art. 

The foregoing description of the specific embodiments will so fully reveal the genera, nature 
of the invention that others can, by applying knowledge within the skill of the art (including the 
contents of the references cited herein), readily modify and/or adapt for various application such 
specific embodiments, without undue experimentation, without departing from the general concept 
of the present Mention. Therefore, such adaptations and modifications are intended to be wrth.n 
the meaning and range of equiva,ents of t. disclosed embodiments, based on the teachmg and 
guidance presented herein. It is to be understood that the phraseology or terminology here.n . for 
the purpose of description and not of limitation, such that the terminology or phraseology of the 
present specification is to be interpreted by the skilled artisan in light of the teachings and gurtance 
presented herein, in combination with the knowledge of one of ordinary skill in the art. 
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EXAMPLES 



Example 1: Identification of the UBP8rp gene 

5 1. Isolation of the UBP8m mRNA 

RT-PCRs were performed on polyA+ RNAs from Stratagene (Reference Nos 778021 
and 778022). cDNA was synthesized with the help of Rt-for-PCR kit (Clontech) with oligo(dT) 
and random primers for each RNA sample. The cDNA quantity obtained for the reactions varied 
between 0,6 and 1,5 ug of cDNA per reaction. A PCR reaction was performed using 5 |xl out of 

10 100 |il of the RT-PCR samples, primers of SEQ ID Nos. 5 and 6, and the rTth enzyme. The 
cycling was as follows: 94°C 5min ; 94°C 20sec,67°C 3min - 32 cycles ; 72°C 10min. 

The first PCR reaction was diluted five fold and 2% thereof was used for performing a 
nested PCR reaction with primers of SEQ ID Nos. 7 and 8. The cycling conditions wer e identical 
as above. 

15 The resulting product was sequenced using primers of SEQ ID Nos. 9-35. The 

sequencing was carried out on ABI 377 sequencers. The sequences of the amplification 
products were determined using automated dideoxy terminator sequencing reactions with a dye 
terminator cycle sequencing protocol. The products of the sequencing reactions were run on 
sequencing gels and the sequences were determined using gel image analysis (ABI Prism DNA 

20 Sequencing Analysis software (2.1 .2 version)). 

The cDNA comprised the Open Reading Frame of SEQ ID NO: 2. This Open Reading 
Frame codes for a 482 amino acid long protein, the UBP8rp protein (SEQ ID NO: 3). 

2. Identification and annotation of the UBP8m gene 

25 The genomic region encoding the UBP8rp protein was identified using bioinformatic 

tools. The UBP8rp gene is shown as SEQ ID NO: 1. This gene is located within the 10 -kb major 
region for susceptibility for psoriasis that was Identified by Veal et al. (2002). The UBPSrp gene 
comprises two introns located at nucleotide positions 1018 to 1046 of SEQ ID NO: 1 and 1676 
to 1718 of SEQ ID NO: 1 (see Figure 1). 

30 Thus it has unexpectedly been found that a novel expressed gene is located within the 

10-kb major region for susceptibility for psoriasis. In prior art litterature, the gene encoding 
UBP8rp was annotated as a silent pseudogene element not comprising any Open Reading 
Frame. 
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3 W y aa of the l IRp lta> protein 

The UBPBrp protein shows eignificant homotofly to the UBP8 ubiqu,«n ,sopept,dase 
Wheh comparing UBPBrp to UBP8 using the BLAST version 2.0 program (Altachul « al 1990 J 
, I, 8^5-403-410* UBPBtp Is found to be 74% Identioal to UBP8 (Rgure 2). More 
^aT J 6 rto4 6 :ofUBP8r P showhomo to g»toam I noao M s7 8 to492o fU BP8 

(81% of Idem** ^ ^ ^ ^ ^ ^ ^ e , a , 1999 proteln 

Engineering 12:3-9. von Hel|ne. 1992 J. MaL Biol. 225:487^94). UBP8rp was found to be an 
,„ loeZ proteto. UBP8rp was former ana^ed us,ng fhe HMMER 21/1 = <B^ 
19 96 Current Opinion h Stoicturel Biology 6:361-365). As shown on Rgure 3. UBPDre drspteys 
mod^e* P*- domain a, an.no aold positions ,84 to 433 (score: 32.7; e«e- 
06) The presence of rhodanese-IIRe domains is a common featore to UBPa a, noe UBP8 and »e 
Zl'sopep^ase 7 from SaccWnyces eerevfcfae also display a single rhodaneee4*e 

15 ^Thus UBP8rp a novel member of *e UBP fcmi*. UBP8rp seems to belong to Oto 
uhiquKm-preteasome pathway, and may play a role in the select deorada«on o, rntoacellutor 
proteins. 
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The expression levels of UBP8rp «n aduK akin, fetal skin. testis, braln. adipose tsaue, 
small interne and oolon was detained using commercial totel RNA (Clomec*). In add-on, 
Te^ston levels of UBP8rp to adun sldn was also detonnined using sKn b,ops,es ton 

25 l'H6pftal Pasteur (Paris, France). . /A _ Ki ^„x 

20 pL of commercial total RNA were treated by 4 units of RNase free DN Ase I (Ambton). 
The cDNA was obtolned using too "Advantage RT tor PGR- Kit (Clontoch) <<*ow,ng the 
Z2L preyed by toe supplier. The 0— e PGR was pertormed 
Univerea, PCR Master mix NO AmpErase UNO (Applied Blosystems). The react™ was 
30 perked with 25 ng of cDNA, 300nM o, each primer and WW. - Taqm*, p£e. The 
prcgram applied was: 40 to 50 cycles a. 95'C for 10 minutes; 95'C for secondee, 60 C for 1 

^ The experiments were psrfomted on a 7900HT Applied Biosyatoms 
experiment was performed either with pdmers of SEQ ID Nos. 36-38 or with primers of SEQ ID 
« ZlZ as detailed in table 1 below. The efficiency of the chosen primers were calculated 
" ^'irtoeDserBu.e.nApp.edBlosystomsdOgT - updatod 10*001 , AB. PRISM 7700 
Sequence Detection System. Relative Quantification of Gene Expression. User Bulletm #2 . 
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Table 1 





Primer 
forward 


Primer 
reverse 


Taqman 
MGB probe 


Amplicon 
length 


1 st set of primers 


SEQ ID NO: 36 


SEQ ID NO: 37 


SEQ ID NO: 38 


98 


2™ set of primers 


SEQ ID NO: 39 


SEQ ID NO: 40 


SEQ ID NO: 41 


109 



The expression was calculated as described by Livak & Schmtttgen (2001, Methods 
25:402-408) The Ct is an absolute value indicating the relative expression level of a gene. A Ct 
under 20 Is indicative of a highly expressed gene. A Ct between 35 and 40 is Indicative of a 

—A ACt 

weakly expressed gene. Calculation of the 2 value allows to compare expression levels of 
a gene in a target tissue to be studied and in a reference tissue. 

In order to confirm that the primers specifically amplify UBP8rp and not another gene, 
the ampllcons obtained by quantitative PCR were sequenced with the forward and reverse 
primers used for performing the QPCR. It was found that the cDNA amplified by PCR effectively 
corresponds to the UBP8rp cDNA. 

-AACt 

The results of the quantitative expression analysis are shown In table 2. The 2 
value was calculated using testis as a reference tissue, numerous genes being expressed in 
testis at high levels. 

Table 2 





Primers 


SEQ ID Nos. 36-38 


SEQ ID Nos. 39-41 


Tissue 


efficiency 


Ct 


2 


efficiency 


Ct 


2 


Testis 




36.1 


1.0 




35.4 


1.0 


Adult skin (biopsy) 




36.1 


2.5 




36.6 


1.1 


Adult skin (commercial) 




39.6 


-5.3 




39.5 


-7.8 


Fetal skin 


104% 


36.7 


-1.0 


77% 


37.9 


-3.6 


Brain 


259% 


38.9 


-8.3 


342% 


38.7 


-11.5 


Adipose 




38.3 


-4.7 




38.3 


-7.0 


Small intestin 




38.3 


-2.4 




39.0 


-6.1 


Colon 




37.2 


-1.7 




36.6 


-1.8 



Using commercially available RNA, UBP8rp is found to be significantly expressed in 
testis, foetal skin, and colon, allthough at a low level. In addition, UBP8rp is found to be 
expressed at a very low level in adult skin, brain, ad ipose and small intestin. 

When using RNA from skin biopsies, UBP8rp is found to be significantly expressed in 
adult skin. Specifically, expression of UBP8rp is found to be higher in adult skin than in any 
other tissue, both with primers of SEQ ID Nos. 36-38 and with primers of SEQ ID Nos. 39-41. 



When the expert is perfom.ed using primers SEQ ,D No, 36-38, expression of UBP8rp is 
found to be 2.5-fold higher in adult skin than in testis. 

Examp ,e 3: Iderrtifigtion of «H« Hfi mnrkors located in the UBPSr 

' ^„«fi«H a = ^tailed below (BM Nos. 17-19, 22, 25, 27- 

Fiahteen biallelic markers were identified as detaiiea dbiuw v 
29 31-35^7-Tl). Fifteen bia„e.,o markers were identtted using sequence data provded by 
Celera (BM Nos. 1, 2, 4, 6, 7, 10, 12-16. 21, 23, 24 and 26). 

50 to 100 ng of genomio DMA from iymphoblastoid cel. lines Luoy or ^Boieth (CEPH 
0 TO „ecC ™d to Urm a PCR reason wHh pdmers of SEO , No, 42 and 43. The 
PGR assays were performed using the following protocol: 

. 5 units of Amplitaq enzyme (Perkin-Elmer, 1^808-0101) 
30 Hi of reaction mix with 10X supplied Taq buffer 
15 - 250 hM each dNTP, 

. 15 pM of each primer 72<>c 
. cycling: 94X 10 min, then 30 cycles of 3 steps - 94 C 30sec , 

30sec, then 72X 10min ampl i flca tion primers. The sequences were 

Tne PGR product was r^*Z£Z^~^^^»* 
20 blasted against genomic sequence, and sequence curv 
17-19 and 22 were thus identified. 

50-100 ng of genomic DNA from lymphoblastoid cel. lines U, ro r J»J 
collection) were used to perform a long-range PCR reaction with pnmers of SEQ ID Nos. 44 a 
25 45. The PCR assays were performed using the following protocol: 
. 2 units of rTTh XL enzyme (Perkin-Elmer) 

50 ul of reaction mix with supplied 3.3X buffer an d 200 pM of each dNTP 

- 20 pM of each primer 

- MgOAc. . 4 . ^ 72 ° C 
30 - Cycling: 94*C 5min, then 32 cycles of 2 steps - 94 C 20sec , 
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ri*g prcduc. was sequenced with me help of the Mowing pairs of primers: SEQ 
, D J£ ^ « ID Nos. 48 and 43. and SEC ,0 Nos. 50 and „ . The sequences were 

2 , se" ,0 No, 4S and 4 9 a lto wed ere . of «* nrarKe. Nos. 
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27-29 and 31. Sequencing by SEQ ID Nos. 50 and 51 allowed the identification of biallelic 
markers Nos. 27-29 and 31. 

The alternative alleles of biallelic markers Nos. 1 -41 and their location within the UBP8rp 
gene are indicated in table 3. Biallelic markers Nos. 2-1 1 and 13-14 are located within the CDS 
of UBPSrp. Biallelic markers Nos. 2-8, 10 and 13-14 are coding SNPs. Biallelic markers Nos. 20 
and 36, which are known to be highly associated with psoriasis, are shown in bold letters. 



Table 3 



BM I 
No. 


Internal I 
Designation 


Position 
on SEQ ID 
NO:1 


Exon 
No. 


coding 


A M - * * 

Alternative 
Alleles 


Sequence 

in: 






3oletn 


Lucy 


1 


hCV1 581 9424 


1199 


- 


~ 


A/G 






2 


hCV1 6030280 


1262 


2 


Yes 


C/T 






3 


SNP n.14 


1426 


2 


Yes 


C/G 






4 


hCV1 1691030 


1444 


2 


Yes 


G/T 






5 


SNP n.13 


1487 


2 


Yes 


A/G 




— " — , 


6 


hCV1 6030281 


1490 


2 


Yes 


A/G 






7 


hCV15819434 


1505 


2 


Yes 


G/T 






8 


SNPn.12 


1518 


2 


Yes 


/"% IT f 

C/T 






9 


SNPn.11 


1554 


2 


No 


C/T 






10 


hCV1 5819435 


1630 


2 


Yes 


A/G 






11 


SNP n.10 


1638 


2 


No 


A/T 






12 


hCV1 6030289 


1680 


** 

(splice 
site) 


~* 


A/G 






13 


hCV16030290 


1895 


3 


Yes 












2180 


3 


Yes 


A/G 






15 


hCV1 6030298 


2449 






A/T 






16 


| hCV1 6030299 


2721 






G/T 






17 


S SNPG3127 


! 3127 






A/G 


A/A 


A/G 


18 


SNPG3137 


3137 






C/T 


T/T 


C/T 


19 


SNPG3138 


i 3138 






A/G 


G/G 


G/A 


20 


SNP n.@ 


! 3183 






A/G 


A/A 


A/G 


21 


hCV1 5824895 


3222 






C/G 


G/G 


G/G 


22 


SNPG3269 


I 3269 






C/T 


C/T 


T/l 


23 


hCV1 5824896 


3445 






C/T 






24 


hCV1 6030306 


3470 






A/G 






25 


SNPG3915 


3915 






C/T 


T/T 


C/C 


26 


hCV1 6030307 


I 3973 






A/C 


A/A 


A/A 


27 


i SNPG4254 


I 4254 






A/G 


A/A 


A/A 


?R 


SNPG4472 


4472 






A/T 


A/A 


A/A 


29 


SNPG4660 


4660 






C/T 


C/C 


C/T 


30 


1 SNP n.8 


4770 






A/G 


G/G 


A/A 


31 


! SNPG4919 


4919 






j_ A/G 


G/G 


A/G 


32 


SNPG4973 


! 4973 






C/T 


T/T 


C/T 


33 


SNPG5063 


5063 






C/T 


T/T 


C/C 


34 


\ SNPG5065 


I 5065 




G/T 


G/G 


T/T 
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Table 3 (continued) 
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CLAIMS : 

1 . An isolated gene comprising introns having a sequence of: 

a) nucleotides 1018to 1046 of SEQ ID NO: 1; and 

b) nucleotides 1676 to 1718 of SEQ ID NO: 1. 

2. An isolated polynucleotide complementary to a messenger RNA transcribed from the 
gene of claim 1. 

3. The polynucleotide of claim 2, wherein said polynucleotide comprises SEQ ID NO: 2 or a 
polynucleotide complementary thereto. 

4. An isolated polypeptide encoded by the polynucleotide of any of claims 1 to 3. 

5. The polypeptide of claim 4, wherein said polypeptide is selected from the group 
consisting of: 

a) a polypeptide comprising SEQ ID NO:3; 

b) a polypeptide comprising a span of at least 470 amino acids of SEQ ID NO: 3; 

c) a polypeptide comprising a span of at least 15 amino acids of SEQ ID NO: 3, 
wherein said span falls within amino acids 467 to 482 of SEQ ID NO: 3; 

d) a mutein of any of (a) to (c), wherein the amino acid sequence has at least 95%, 
96%, 97%, 98% or 99% identity to at least one of the sequences in (a) to (c); 

e) a mutein of any of (a) to (c) which is encoded by a nucleic acid which hybridizes 
to the complement of a DNA sequence encoding any of (a) to (c) under highly 
stringent conditions; and 

f) a mutein of any of (a) to (c) wherein any changes in the amino acid sequence are 
conservative amino acid substitutions of the amino acid sequences in (a) to (c). 

6. An expression vector comprising the gene of claim 1 . 

7. An expression vector comprising the polynucleotide of claim 2 or 3. 

8. The expression vector of claim 6 or 7, wherein said polynucleotide encodes the 
polypeptide of claim 5. 

9. The expression vector of any of claim 6 to 8, wherein said vector is a gene therapy 
vector. 

10. A host cell comprising the expression vector of any of claims 6 to 9. 

1 1 . A method of making a polypeptide, said method comprising the steps of culturing a hos t 
cell according to claim 10 under conditions suitable for the production of a polypeptide of 
claim 4 or 5 within said host cell. 
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12. The method of claim 11. further censing the step of purifying said polypeptide from 
the culture. 

1 3 An antibody lhat specifically binds to the polypeptide of claim 4 or S. 

14. use of a polypeptide of claim 4 or 5 as a target tor screening for natural binding partners. 

15 Use of the polypeptide of claim 4 or 5 as a ta^et for screening candidate modulators. 

16. me use o, claim 15. therein said candKate moduiator is selectod from the^oup 
consist, of a natural Hgand. a email molecule, an aptamer, an antsense mRNA. 
small interfering RNA and an antibody. 

17. The use of claim 15 or 16. herein said modulator is a candidate drug for the treatment 

of a chronic inflammatory disease. 
IB.TheuseofanyofdalmeUtolT.^ereln^acdvi^ofsaldp^pepUdeofcteim*^ 

is assessed by measuring the MNM acSvlty of said polypepSde of Ca«n 4 or 



5. 



,9 Tbe use of any of Calms 14 to 17, wherein the activity of said polypeptide of dalm 4 or 5 
is assessed by meas^ng Ihe de-ut*,u,«naung aCvHy o, UBP8 in the presence of sa* 
polypeptide of claim 4 or 5- 

20. Use of a modulator of a polypep«de of dalm 4 or 5 tor preparing a medicament for the 
treatment of a chronic inflammatory disease. 

21. The use of claim 20. wherein said modulator is used in combination with a Known drug 
for said chronic inflammatory disease. 

22. The use of any of Cairns 17 to 21, wherein said chronic inflammatory disease is 
psoriasis. 

23 A method of assessing the efficiency of a modulator of a polypeptde of claim 4 or 5 tor 
»e treabnen, of psoriasis, said me*od composing administering saH modulator to an 
animal model for psoriasis; wherein a determination that said modulator ameliorates a 
represented dtaractori* of psodasls in said animal mode. Scales fha. sa,d 
modulator is a drug for the treatment of psodasls. 

24. The method of dalm 23, wherein said representative characteristic Is a Psodasls Area 
and Severity Index score. 

25 The method of claim 24. wherein a 75% or greater Improvement in Psonasis Area and 
' seventy Index scorns (PASI 75) indicates that said modulator is a dreg forthe treatment 
of psoriasis. 
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26. The method of any of claims 23 to 25, wherein said animal model is a SCID-hu Mouse. 

27. Use of at least one UBP8rp-related biallelic marker selected from the group consisting of 
the biallelic markers shown below for determining whether there is a significant 



association between said biallelic marker and a chronic inflammatory disease: 



Biallelic maker No. 


Position on btu iu nu. i 


Alternative nucleotides 


1 


a a no 
1199 


A/o 


2 


1262 


U/ I 


4 


A A A A 

1444 


G/ 1 


6 


1490 


A/G 


7 


1505 


g/t 


10 


1630 


A/G 


12 


1680 


A/G 


13 


1895 


A/G 


14 


2180 


A/G 


15 


2449 


A tV 

A/T 


16 


2721 


G/T 


17 


3127 


A/G 


18 


3137 


C/T 


19 


3138 


A/G 


21 


3222 


C/G 


22 


3269 


C/T 


23 


3445 


C/T 


24 


3470 


A/G 


25 


3915 


c/r 


26 


3973 


A/C 


27 


4254 


A/G 


28 


4472 


A/T 


29 


4660 


C/T 


31 


4919 


A/G 


32 


4973 


C/T 


33 


5063 


C/T 


34 


5065 


G/T 


35 


5079 


C/T 


37 


5088 


C/G 


38 


5090 


C/T 


39 


5407 


C/T 


40 


5466 


A/G 


41 


5520 


C/T 



28. Use of at least one UBP8rp-related biallelic marker of the table set forth in claim 24 for 
diagnosing whether an individual suffers from or is at risk of suffering from a chronic 
inflammatory disease. 

29. The use of claim 27 or 28, wherein said chronic inflammatory disease is psoriasis. 

10 30. A method of genotyping comprising the steps of: 

a) isolating a nucleic acid from a biological sample; and 
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b) detect the nucleo.de present at ene or mo™ of the UBPSrp -rotated biallellc 
markers of the table set forth in claim 27 . 
„.H» method of dalm 30. wherein aatd biological sample ia derived from e single 
individual. 

32 The method of claim 31 , wherein the identity of the nuc.eo.dea at said MM. marker Is 
Ln^nedfc.c.mcop.eso.sa.dWMIomarirerpresen.lnsaldlnd.vlduarsgenome. 

33. The method of any of claims 30 to 32, when* said determining is performed by e 
microsequencing assay. 

34 The method of any of cfaima 30 to 33. further comprising amplifying e portion of a 
sequence comprising the blallelio marker prior to said determining step. 

35 The method of claim 34, wherein said ampHfying ia performed by PCR. 

36 The method of any of claims 31 to 35, further comprising the step of chelating the 
Z* of me geno^plng steps w»h a risk of suffering from a chronic .nuammatcry 

disease. 

15 37. The method of olaim 36. wherein said chronio inflammatory disease is psoriasis. 



10 



43 



ABSTRACT 



The invention encompasses a novel gene encoding a protein of the ubiquitln- 
proteasome pathway. UBPSrp. The invention also relates the use of UBP8rp polypeptides for 
screening for modulators, and to the use of said modulators for treating chronic inflammatory 
diseases such as, e.g., psoriasis, psoriatic arthritis, rheumatoid arthritis, asthma, inflammatory 
bowel disease and multiple sclerosis. The invention further relates to the use of biallelic markers 
located in the UBP8rp gene for diagnosing said chronic inflammatory diseases. 
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FIGURE 1A 

F ran,e3 L C # S L L F 6 L S L I T M F ft E ^ « P x 
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FIGURE IB 
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FIGURE 1C 



o Ai «, crPl YFTFSQCS*KSHQ 

Frame! » R x l I «/////„* > / P~ k" a' 

K hCVll691030 R SNpl3 G:allele ! 

Frame2 Q E S L L A G L K H T S Q M _ A T w 
CCAGGAGTCACTGCTAGCT^A^G^G^^CCTCC CAGATGATTCTATAGAT ^jfjlf?? 
KhCVl5819434 Y hCVl6030288(SNPl2) T:a11e1el Y SNPll C. 
allele 2 

^"k'-B'B-h'-k'-c-G v;r« ;v77' G v7 L c Q K K R i 

Frame2 R R G G m w :> x w t r p 
SST 1 AAGAAGAGGG6GAATGTGGAGT ATATGGTAcWc^GACTGGTTTAGTTCreCAAAAGAT 1620 

53T el Ac2ga^ggVaZaa^ctgg CATCTG/^AGATGCACTTTTC^/^KTG^C^VAAGTAAA 1680 

R HCV15819435 s456> R 

HCV16030289 splice site ^ hcvl58194 3 6 SNpl0 T:alle le 1 

mes - "n" 'c 'pMrA S'/SSSl'J 1 i £ y " "£ J* "5 V 

Frame2 L S C A M G L W G / F L 6 V F L R c G R L * k l V 

™ ACTGTCCTGTGCAATGGGCCrTGGGCCTTTCGTTTTAGAG^GA 1740 

lm Bl CCTTTGCTATTCrcAGTATACAACAAATGCT AAGOTWCTCCACCCCCAOVACACCAGAA 1800 

<R541 

^iSi! " L £ S L s" D i ^ / s" l" E~ / P^S 

ofir 61 TGAAGAG^GTCTATCTCA^GGATTTrACTTATCCCT I860 

1920 

e< 

Framed" *v" "l" "k" "v" "i"k" "li" "r" ~G Q D H * I Y Q 

Samel s D Q I s J> J> Q N & E R T G P ^ N X S I 

D^r 61 A^CTGAtIa^^^ 1980 




4/13 



FIGURE ID 



sr 1 a^™**™*^^ 2160 

Frame2 V I P D C s t r f v v » » v N R 

^A mel AgStATTCCTGATTGTTCCA^CA^GCCACT 2220 

R HCV160 30297 T/A E2F-1 

TCAAGAAAAGGCT^TXrrWTGCAGAAACTffln'CrTCTAATGGAGAAAAACAAACA'lGA 2280 




DMA " - " W 



K 



F«3*T*Q "k'V'k V'K "6 E A E E N E I T Q K Q Q 
Frame2 A K S Q K E T R s R K K L H R S N k 
Sna^ 1 aIcaL^gccaaLvga^ca^ga AGCTGAAGAAAATGAAATTACACAGAAGCAACAAA 2400 

I END or orr 

Fr^3''ic'V'K"*E""E''M'"E"T'K"'E'"R"y*Q "a K K E D J< E 
FrameZ S K R R N G E E R T " T r G ? Q r E r R r G ^ * H 

aLcaUga^aaat^^ 



Fr^.43 I S A K K G K E I T R V K R Q S K S H # 
SSS S L 0 S Q K R E R A K K * N Q E S K K V K V I M 

55a tctcagcaaagaagggcWgaaataacaagagtaaaaagac^ 



Frmki"k"V'k"G'k"k"K s v e d r g r r c s t p e 
Frame2 N L W C R E V C ^R 6 Q 6 E ^ p M y F q N p P q R ^ 
Framel K _pl v p R A * AC . CCArAC< - cccagaaGATGTTCAACCCCAGAAG 2580 



^'7/7/777^ p" Y T I 5 C A D T R G G° F S rV R 

™ L t Icag^1Vca^^ 

WA mel gcaagccttttaagattaaagga^^ 2700 

Frame2 R G Y R * Y L K K * N S T £ R T l F r D # S ^ 
D^ mel gIgAG^ACaU™^ 2760 

Frames' 'r''s'TT*m 'g"r 'm V 7"g'l'7"s p S L * 

Frame2 K*RNGEDGTRTAFR^- < GQV^^ 
DNA mel GAACTGAAGAMTGGGG^ 2820 
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FIGURE IE 



Frames d P I T C T F H Y Y H S P L T t J J C T ? 

™ fi ^A^^ 2880 



L."l H L H U L P P L Q L I K A S 



SSS " ^ Z » ^ P P ///,V H T P" S P N T S H * K R G a" / „ 

KT 1 ACTCGAAATCGCTCCTTCATCTGCA 2940 

Frame3 H R F L L s k a « » - ■ » K * C S Y S S P 

Frame2 Q I P A K Q d R e k » E M L L L L P 



TT-F--c'-L--s"R-Tv:N-;t';p * • » a l t p s p 

Q I P A K Q P R ^ _T — n T E M L U L L t 
SSr 1 ACAGATTCCTGCTAAGCAS^^ 3000 

^3-^x7777777777?' 7s p n t s v * n sVr 

Frame2 Q A N H L P * " - <- F L L u R F G 
Framel T . S_Q_H_A. J^ AA L^ A LTCTC^GGCTTTCTGCrrcrcAGATrre6A 3120 



"c'Y P "k A E I S R U S A S Q _I W 
4 L P * K S L * R D S L , 
DNA ACAAGC^ACATGCTACCCTAAAGCTGAGATCTCAAGGi 

Frame? K _L _N ^P V p F ^ r U p » w « y- g » g - y w ~ T ~ s * u R _ K 

r-.«,.«~ .. - - L P L 

DNA AACTCAATCCTGTTTTT' 



FraiiZ T Q S C F W R F W . » = • L D F V T * E ^ 
Z™* 1 AArTCAATCCTGTTTTTGGAGGTTCTGGAC^ 3180 

, JwaS* c/t nicy, t/t Boleth 
I SNPG3138 G/A tucy G/G Boleth 

Frame2 Y L L Y E L N I A V P m » x w L j 

Framel I L V I * JQ J tTCcAGT ^GCCTATGTAATCCTCCACATTTGGCTGATT 3240 
A gsnp9- A allele 2 (associated) AG Lucy, £ "ggfoj 

GG LUCy, GG 

boleth 



a^ImIc^^^t;^^^™^ «• 



* 

K 
K 



F^'V'i""s'V'R "R TV'Y V H E S P V D R T g V x p 
Frame2 v K. w Q K M L V S k p k « Q y R 
™ el GGTGAAG TGGG\G AAGAAT^GCTATAAT^TGAAAGCCCCGTGGACAG^ 3360 

Frames- -i"Y-Q J J'/ 7'*' J}' ^''^i' 11'^' 7 s u Q o 
Frame2 I S V Q K T L K S P L, » a p A 6 

D^r 1 tatatcagtcovaaagaccVtaaaotcaJca^ 3420 

Frame2 T V K I H K N F F c s w H £ D L 

Framel Y s Q d s Q e f l l _i- t w A TGGTCXCCATG aagatcta 3480 

DNA TACAGTCAAGATTCACAAGAATTTOTCTCrrCCTAATGGA ^i*- h CVl6030306 
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FIGURE IF 

*RRK**SSQ*L 



G 
R 



Frames K * N • * S E 0 I • R * * " / ^ 5 M T L 
Frame2 I K L I „ R v K E E N N D H L. N D F 

^ mel AaVaI^CTGATA^ 3540 
Ffig"*"^" [_C Vt' TH" '*''j' T h ' S A '7J-* S V J J L C v L C a H 

S? 8 a£gcVg£g1ag^ 3600 

Frame2 F F "VMS N L Q w o c L T R H K K S 
Framel ,^^1^^^ 3660 

Frame2 h l R p S c i c l y a s t s k c t l 
CSr 1 aJac^tgaggcct^cat cta-^tgtctctactgatagcatccacaa gtaaat gtacatta 
3720 

ESS""*"? i L a' 'l'o* T/'l'**'** n T s H ** i* 

K?* rlLSnScAiA^^ 3780 

Frame2 A I F A E L D G I L ^IC K p m L E V T T C 
™ X TGCAATCTXrGCAGAGCTC^CGGGATTCTTAAAAAAGAAATCTGG 3840 



............ "■"*•' *i" "i* " t" " p" *P* "t" m aggnknyr 

Frames C F * C I * N I F P T M a * r l q 

Frame2 L L V H l k h t- » t i ™ £ T K x T D 

ST 61 gctt^agtg^tctga^a^tttttcctacaatgg^g J'^^amcaaaaa^acagac 3900 

■p"c"H*"s "m" i" i V Q R 



ks " s l v- o t -vvvvvvv vvvvv v / v." 

sr 1 A?c4 re | M ^^i™ s ^^^^ 3960 

Frame2 N l k k y n l r => v ■ =» w A G w R 

SET 1 caVt^gLga^at^^ G6a6g 4020 

M hcvl6030307 M Lucy, AA Boleth 
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FIGURE 1G 

EMLIKIV V 



Frames * L S F JC R L 5 N T I L # E Ji ^ f Q * G 
Boleth 



B'B\y'-JjJ J C W T S T 1 x 

V L L A L Y 



Frame3 A 1 C A w 6 P H F L R E g I l g Q w e l 

■■•""•••*q"v'"l Q * _A _S J- J 



Frame3'"*'*F"L S V V T N J* J * Q v ^ A « y g j t y R 

FSii3"s"T" TVrn' 't / F L S V L C L S q" S V *" N 1 «* >- A 

; mm ;"C,"*"i"o"i YPFFPFHC 

Frame3 L S I I 1 > L V / / v p F F S F S L L 

gST 61 A^GCCl^TrlACATrrCTAAATCCCATCrTC 



Fr»a"' i "'j""" ; "6''i';n'' i " T V L L ^ y T ^ jS ^ ^ ^ y 
Framel H s L T c G ^pf v n ArTTAGAACT cagaggattgcattttt 



EST 1 cat.!^^^^^.^ 

agggcaaagctatt 

"* * K " I M G 



Frame2 * A v F # E * l Q ■ c f l # R e « . - ^ - 
SET 1 TTGAGCATATTTT^GTAACTGWGTGCTTTCTTAGGGAM 4740 



■■• "r r flKSLS •KIN*' 

FPame3 F L L A L G A F 6 C A K j p j |_ £ m K W * 

DNA TTCTGTTGGCnTTGGGGGC^TTTGGGrrGCGCTA^TCri * M-^ !,^^ ol eth aa LU cy 



ES- iw^^^ 4860 
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FIGURE 1H 

ESS 5 X* T L A L H 1 K E V* - 1 
ES" 1 CAATATCACTGCtIaTTAG^^^ 

GG Boleth SNPG4919 

ESr 1 CA<nWGAATGOT^TTATTAAAAAGTC^GAAACAATA6ATGCT6CTGASGCTOTGtoG^98^ 
Boleth SNPG4973 

Htt''i''i''*'r'i''L''H^ ' v w E K q 

Frame2 N R N T F T L L L G M »| * F N H C G R 

^A Mel aa^taggaacaVtttt^^^ 5040 

Frame2 S V A I P Q R S ^R T R N y t x t l w v S 
5BT 1 CAGTCTGGCCATTCCT^GAGATCTAGAACCAGAAA^ 

tcg Boleth c/T Lucy, c/c Boleth 
SNP3G5063> CCT Lucy 6cr BQleth 

CCC Lucy <SNP3G5088 



mSS«Z P K 6 I * I I L L * R H M H Y J J 

5T" 1 ACCCAAAGGAATA^ 5160 

Frame2 T x Y N s k d l fc r 1 11 * * M D 

ST* CACTATTTAC^^ 5220 

Fra^3"i"R "k"c" 'g V 'V "h"h"« "i" K K 6*" M * S V S 

Frame2 K K M W C I P P W N s T Q p E ^ » " E 3 F 

ST* AAAGAAAAT^GG^ 5280 

sir* aCi^g^g^ 5340 

Frame2 N R K Q S T S C S H 5 s E S * t M T T 

gaacagaaaaL^gcacctcatchtctIatVcctaagt GAGAGTTGAACAATGACAACA 5400 

-gIaCACAGG^^ S4S0 
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FIGURE II 



rr«3 . „r s t • . » . , • 0 v « A « „t t » » ° V A 

i R G W V N R c S K P p w r 
DNa"" 6GACGAGAACTTAGA6GATGG6TCAATAGGTGCAGCAAACCACCATG6C 



Framel G .R^^^^^^^caaaccACCATGGC AGACTATACGC_55H> 

| gg Lucy, GGBoleth 
Lucy TT Boleth 

Frame2 C N K P A G S A h v » w « v K * N K 

^ Mel ATGTAACAAACCTGCAGGTTCTG^W 5580 
™3"S 'k"'l"'k"k"e "S"P'"C "L**f "C / h 1 C A 5 A T L X 

F rame2 Q x K K R K p m s y m y a y v g H ^ r y 
^f el gc^aatt]Lxaa^aa6aa^gccc5tgtctta ^TCTATGCATATGTT^TTGCAGCACTAT 5640 

yr^3^-'r-x'k'jjy^''^'/7^'/' x s N M G v R D L w d 1 k k e 

Frame2 H N S K D M E S T M 5 x w # Q « R 

TCAC^GCA^G^^ 5700 

f^3"k'V'w-;q *x- m c-'s-;t v Q v777';v L s T Vs v s 

Frame2 N VAN M L Y R Q d J- « G * M s H R K F 
SST el AaVctGG^TATG^ 5760 

Frame2 p L P d M G p S a s t i- « s R H R 

ST 1 tIcactccc^gacatg^^^ 5820 

Frames W E G E E L P R G M / H P S R s T S Q P s 
FrameZ G R " c c D k n d p s k P v h L T A F 
SSr 1 GGGAAGGTGAA^GCTTCCCTAAGGATGACC^TCCAAGCCG 5880 

h-r"a*>-;;s_r_q g a q r d i p q r 

^ mfil CTGGGATACAAGGCTGGCATGACCCAC^ 5940 

!SS tVrV P « g A " x L 
E el ctgaacaagaaggaggtggtgg^ 6000 

Frame2 w a A w k P l k a s e l a e h 

ST 1 6060 

F rame2 S v M N A R G V s I R t g g r r k a 

ATCAGTGATG^TGCAAGAGGCGTrTCTATAAGAACTGGCATAAAT ^^GA^GA^GGCC 6120 

^^'■^^vrrrH;^^ 1 ;^*;^ S A S A W G R E R G T 

Sr tU^^^^ 6180 



Fr^3-py D i-;a' R G' W H # D / T « S R C A 6 G K S S R T Q C 
Frame2 W D T l/ R A L r A MT H I v R E V D R 
Framel L G Y K A ^ M " A ± rmr -^ AA r T r/:ArAGC 
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FIGURE U 



FfiiS'-C ;h"a T -R L / W G £ " * R E S V G C S P S T S P 

«a 6780 
F 1 6960 
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FIGURE IK 



rM .3 A T GG X S IK V ! F B * K K K K * £ K « 
Frame2 N W w D L _ N K Y F p ■- K K R K R K C 
g2T l CMCTGGTGGG^TCTC/^TAAAAGTTATTTT^ 7020 

zm* m «"b *i v 't -t"b v ;jtv rn v v #E o i h m yS lF c 

53T 1 cSggaacatggatS^^^ 7140 

Frame2 Q L P H V L T y =• ^ ft E H MD T 

EST 1 ccaa^accacatgVtctcacttataagtagg^ 7200 

Friia- t- ■rrrn'lv" G %V* k ' G ° *V * w 6 E G E * G * E 

Frame2 R R E Q H T l g s i « G R R E R 

set 1 &£a£f^ 
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FIGURE 2 



SEQ ID NO-3 17 KKKTHPHQKVGKGYKQTLLRGRHLRGQETXEKKLTH — VXET T ggSo^W«"» 

SEQ ID NO: 4 34 KSiVHSALKIFKTAEECRLDRDEERAYVLYMKyVTVlNL 

SEQ ID NC.3 X34 ^ 

« ID N0:4 154 " 

ID NO -3 194 CIX.HSLSVPX^ISPXV*^^ 253 

id r. l s^^-— m 273 



SEQ 
SEQ 



-KGGYKNWFLCYSQYTTNaKVTPPPQHQNEELSISLD 301 
SE Q ID NO:3 2S4 LWHLKDXLFKHE +GCW+NW LCX QYTTNAKVTPPP+ ,„ 

SEQ ID NO:4 2 74 ^D^^^ ^ 

SE Q ID UO-.3 30 2 T. 7 
Z ID NO-4 334 JSSSSSA^^SS& 3" 



SEQ ID UO:3 479 E 479 
E 

SEQ ID NO: 4 508 E 508 
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FIGURE 3 



rhodanese 
SEQ ID NO: 3 
rhodanese 
SEQ ID NO: 3 
rhodanese 
SEQ ID NO: 3 



* ->tagelkalles .apkliliDvRspefGeryeyegGHIpgAvNvplee 
ta+el+ ++ +++ + + + y+ ++I ++ vp 
164 TAKELYTMMMDkNISLIIMXAQRM QXYQDSCILHSLSVP-XX 204 

eiealldrsgilpdieklhllkdpeelaklfgelgsskdkrvivycrsgr 
+i+ +++s+i++ hl+ d ++k+ g+ + + + +s + 
205 AISPXVTASWIEA HLPYDSIXTWKKRGNVEY MVLLDWFSSAK 246 

dllrnrrsalaalllkklgypeVyiLkGGykeWlak< -* 
+ + + ++lk + kGGyk+W+ + 

247 DLQIXTTLWHLKDXL FKWEKGGYKNWFLC 275 



SEQUENCE LISTING 



<110> Applied Research Systems ARS Holding N.V. 
5 <120> NOVEL UBP8rp POLYPEPTIDES AND THEIR USE IN THE TREATMENT OF PSORIASIS 



<130> 886 EP 

10 <160> 51 

<170> Patentln version 3.1 

<210> 1 

15 <211> 7260 

<212> DNA 

<213> Homo sapiens 

<220> 

20 <221> exon 

<222> (851) . . (1017) 

<223> exon 1 



25 <220> 

<221> Intron 

<222> (1018) . - (1046) 

<223> 

30 <220> 

<221> exon 

<222> (1047) . . (1675) 

<223> exon 2 

35 

<220> 

<221> Intron 

<222> (1676) . . (1718) 

<223> 

40 

<220> 

<221> exon 

<222> (1719) . . (2371) 

<223> exon 3 

45 



<220> 

<221> allele 

50 <222> (1199) . . (1199) 

<223> Biallelic marker No. 1 



<220> 

<221> allele 

<222> (1262).. (1262) 

5 <223> Biallelic marker No. 



<220> 

<221> allele 

10 <222> (1426) . . (1426) 

<223> Biallelic marker No. 



<220> 

15 <221> allele 

<222> (1444) . . (1444) 

<223> Biallelic marker No. 



20 <220> 

<221> allele 

<222> (1487) (1487) 

<223> Biallelic marker No. 5 



25 

<220> 

<221> allele 

<222> (1490).. (1490) 

<223> Biallelic marker No. 6 



30 



<220> 
<221> allele 



<222> (1505) .. (1505) 
35 <223> Biallelic marker No. 7 



<220> 

<221> allele 

40 <222> (1518) .. (1518) 

<223> Biallelic marker No. 



<220> 

45 <221> allele 

<222> (1554) .. (1554) 

<223> Biallelic marker No. 



50 <220> 

<221> allele 

<222> (1630) .. (1630) 



<223> Biallelic marker No. 10 



<220> 

5 <221> allele 

<222> (1638) . . (1638) 

<223> Biallelic marker No.. 11 



10 <220> 

<221> allele 

<222> (1680) . . (1680) 

<223> Biallelic marker No. 12 



<220> 

<221> allele 

<222> (1895) . . (1895) 

<223> Biallelic marker No. 13 



<220> 

<221> allele 

<222> (2180) . . (2180) 

25 <223> Biallelic marker No. 14 



<220> 

<221> allele 

30 <222> (2449) .. (2449) 

<223> Biallelic marker No. 15 



<220> 

35 <221> allele 

<222> (2721) . . (2721) 

<223> Biallelic marker No. 16 



40 <220> 

<221> allele 

<222> (3127) . . (3127) 

<223> Biallelic marker No. 17 



<220> 

<221> allele 

<222> (3137) (3137) 

<223> Biallelic marker No. 18 



<220> 



<221> allele 

<222> (3138).. (3138) 

<223> Biallelic marker No. 19 



<220> 

<221> allele 

<222> (3183) .. (3183) 

<223> Biallelic marker No. 



<220> 

<221> allele 

<222> (3222).. (3222) 

15 <223> Biallelic marker No. 21 



<220> 

<221> allele 

20 <222> (3269).. (3269) 

<223> Biallelic marker No. 



<220> 

25 <221> allele 

<222> (3445) .. (3445) 

<223> Biallelic marker No. 23 



30 <220> 

<221> allele 

<222> (3470) .. (3470) 

<223> Biallelic marker No. 24 



<220> 

<221> allele 

<222> (3915) . . (3915) 

<223> Biallelic marker No. 25 



<220> 

<221> allele 

<222> (3973).. (3973) 

45 <223> Biallelic marker No. 26 



<220> 

<221> allele 

50 <222> (4254) .. (4254) 

<223> Biallelic marker No. 27 



<220> 

<221> allele 

<222> (4472) . . (4472) 

5 <223> Biallelic marker No. 28 



<220> 

<221> allele 

10 <222> (4660) .. (4660) 

<223> Biallelic marker No. 29 



<220> 

15 <221> allele 

<222> (4770) . . (4770) 

<223> Biallelic marker No. 30 



20 <220> 

<221> allele 

<222> (4919) . . (4919) 

<223> Biallelic marker No. 31 



<220> 

<221> allele 

<222> (4973) . . (4973) 

<223> Biallelic marker No. 32 



<220> 

<221> allele 

<222> (5063) . . (5063) 

35 <223> Biallelic marker No. 33 



<220> 

<221> allele 

40 <222> (5065) .. (5065) 

<223> Biallelic marker No. 34 



<220> 

45 <221> allele 

<222> (5079) .. (5079) 

<223> Biallelic marker No. 35 



50 <220> 

<221> allele 

<222> (5080) . . (5080) 



<223> Biallelic marker No. 36 



<220> 

<221> allele 

<222> (5088) (5088) 

<223> Biallelic marker No. 37 



10 <220> 

<221> allele 

<222> (5090) (5090) 

<223> Biallelic marker No. 38 



15 



20 



25 



<220> 

<221> allele 

<222> (5407) (5407) 

<223> Biallelic marker No. 39 



<220> 

<221> allele 

<222> (5466) (5466) 

<223> Biallelic marker No. 40 



<220> 

<221> allele 

30 <222> (5520) (5520) 

<223> Biallelic marker No. 41 



35 



^tatgU tcttttgttt ggtctctccc ttagcataac gatgtttgag atgatgccat 
tcattcattt ttgttgctga gcagctgccg agtattgttg gaatcccagt ttattcattg 
gtttctgtgt ctccagttga tagacatgtg gattcctcca gttagggttt gttattaatg 



120 
180 



aagccactat aaataactgc ttacaagtgt ggacttacat ttttatttct tttggataaa 
tacgtatttg tggaattgct gggccatgtg gtaatagatg ggtaactgta taagaaactg 
45 ccataccact ttacaaattg gctgccacat tttttgcatt cctaccagca atatcagaca 
ttcctatttt ttccatattc ttgccagtgt: taagacttat catatgtctt tttaacttta 
tctgctctag gtgatgtgtg atggtttctc attgtggttt taacttgcac ttctttgatg 
50 actagtattg tttgctatct tttcatgttc atctaagcga cttattacat atattttatg 540 



300 
360 
420 
480 



10 



aactattttg caaattcaat gattaattcc agagactttt tcagaattcc ctagtgtttt 600 

ctacatatac aatgaagttg gtgacaaaga aagactttca tttcttcctt tcttatccat 660 

tgatcttttt tcttttaaaa ttattattat ttggtagaga tgaggtctca cttatcaggc 720 

tggtctcaaa ctcctgatct caagtgatcc tcccacctca gcctcccaaa atgcagggat 780 

tacaggcatg agccaccatg cctggtcctt gttgcactgg ttaggatgac tgttaggtgt 840 

ttaaacaaga atg atg aga get cac atg ttt gtt tac aag gaa ctt aaa 889 
Met Met Arg Ala His Met Phe Val Tyr Lys Glu Leu Lys 
15 10 

15 caa att tac aag aaa aaa acc cat ccc cat caa aaa gtg ggc aaa gga 937 
Gin lie Tyr Lys Lys Lys Thr His Pro His Gin Lys Val Gly Lys Gly 
15 20 25 

tat aaa cag aca ctt etc aga gga aga cat tta cgt ggc caa gaa aca 985 
20 Tyr Lys Gin Thr Leu Leu Arg Gly Arg His Leu Arg Gly Gin* Glu Thr 
30 "* 35 40 45 

tat gaa aaa aag etc aca cac gta tat gaa ac gtgactgttt ataatcctat 1037 
Tyr Glu Lys Lys Leu Thr His Val Tyr Glu Thr 
25 50 55 

ccaaaaaag a act gat ttc aag caa cag cag tat tac ttc cat tea ata 1086 
Thr Asp Phe Lys Gin Gin Gin Tyr Tyr Phe His Ser lie 
60 65 

30 

ctt gga cct gca aac ate aaa aaa gee act gga gaa act gaa cga etc 1134 
Leu Gly Pro Ala Asn lie Lys Lys Ala Thr Gly Glu Thr Glu Arg Leu 
70 ~ 75 80 85 

35 tct gaa age ctt aaa eta aga tat gaa gaa gtt gaa ate tgg aaa aaa 1182 
Ser Glu Ser Leu Lys Leu Arg Tyr Glu Glu Val Glu lie Trp Lys Lys 
90 95 100 

ctt gag gaa aag gac arg cag ggg gaa gca cag tgg eta caa caa aaa 1230 
40 Leu Glu Glu Lys Asp Xaa Gin Gly Glu Ala Gin Trp Leu Gin Gin Lys 
105 110 115 

agg cag gaa aca gga aga gag gat ggc age ayg ttg get aaa ggt tct 1278 
Arg Gin Glu Thr Gly Arg Glu Asp Gly Ser Xaa Leu Ala Lys Gly Ser 
45 " 120 125 130 

ttg gag att gta ttg gat tec aaa gac aaa acc caa aag age aat ggt 1326 
Leu Glu lie Val Leu Asp Ser Lys Asp Lys Thr Gin Lys Ser Asn Gly 
135 140 145 



50 



gaa aag aat gaa aaa tgt gag acc aaa gag aaa gga gca ate aca gca 1374 
Glu Lys Asn Glu Lys Cys Glu Thr Lys Glu Lys Gly Ala lie Thr Ala 



155 160 165 

150 155 



S E £ £ £ £ £ 2 S i 2 S E 2 S S 

5 170 175 

at, sat act caa a 9 a at ? eg *at tat caa oat tec tot att tta cat 
Met Xaa Ala Gin Arg Met Gin Xaa Tyr bin as P * 
185 l"U 

10 £ £ £ IS £ E £ K £ £ £ £ g 2 - Si 

200 205 

„_ „+• tct ata gay aca tgg aag aag 

15 £ £ £ E £ £ £ £ £ S n. £ - ^ w 

215 220 

20 s s £ £ E £ £ £ £ £ S S S 2 = S 

230 235 

£ £ £ E £ £ £ £ S I = = 2 - £ 2 ° 



25 



£ S £ £ £ £ 3 £ £ 1 2 £ 2 2 2 £ 

270 275 

- £ £ £ £ £ £ £ £ 2 E 2 £ £ 1 2 £ 

285 290 

40 £ £ £ £ £ £ £ £ £ £ £ 2 i 2 £ £ 

300 305 

£ £ £ £ E £ £ £ £ 2 2 £ 2 £ E £ 

45 315 320 

£ £ £ £ £ £ E £ £ S 2 £ £ £ £ £ 

330 335 

4- o=.-h tea att cca gtt gaa tea gtt get get tct 
aca gga cca ctg aat ata tea att ce g g ^ ^ ^ ^ 

Thr Gly Pro Leu Asn ixe £>er xxe jtj-u ~ 



1614 



1662 



1718 



ttc aag tgg gaa a gtaaractgt cctgtgcaat gggccttggg cctttggttt tag 

Phe Lys Trp Glu 
265 

30 ^-4- +„r* tat tec caa tat aca aca 1765 



1813 



1861 



1909 



1957 



350 



355 



360 



aaa tct gat gtt tea ccc ate att cag cca gtg cct age ata aag aat 2053 
Lys Ser Asp Val Ser Pro lie lie Gin Pro Val Pro Ser lie Lys Asn 
365 370 375 

gtt cca cag att gat cat act aaa aaa ctg gca gtc aaa ttg cct gaa 2101 
Val Pro Gin lie Asp His Thr Lys Lys Leu Ala Val Lys Leu Pro Glu 
380 385 390 

gag cat ata ate aaa tct gaa agt aca aat cat gag caa cag tct cct 2149 
Glu His lie lie Lys Ser Glu Ser Thr Asn His Glu Gin Gin Ser Pro 
395 400 405 

15 cag aat gaa aaa gtt att cct gat tgt tec rcc aag cca gta gtt tec 2197 
Gin Asn Glu Lys Val lie Pro Asp Cys Ser Xaa Lys Pro Val Val Ser 
410 415 420 425 



10 



tct cca act etc atg tta aca gat gaa gaa aag get cat att cat gca 
20 Ser Pro Thr Leu Met Leu Thr Asp Glu Glu Lys Ala His lie His Ala 

430 435 440 



30 



35 



40 



45 



50 



2245 



gaa act get ctt eta atg gag aaa aac aaa caa gaa aaa gaa ctt cag 2293 
Glu Thr Ala Leu Leu Met Glu Lys Asn Lys Gin Glu Lys Glu Leu Gin 
25 445 450 455 

gaa aga cag caa ggg aaa cag aaa gaa act gag gag gga aga aca cga 2341 
Glu Arg Gin Gin Gly Lys Gin Lys Glu Thr Glu Glu Gly Arg Thr Arg 
460 465 470 



gca aaa age caa aaa gaa aca aga age tga agaaaatgaa attacacaga 2391 
Ala Lys Ser Gin Lys Glu Thr Arg Ser 



475 




480 










agcaacaaaa 


agcaaaagaa 


gaaatggaga 


agaaagaacg 


tgaacaggee 


aagaaagwgg 


2451 


ataaagaaat 


ctcagcaaag 


aagggcaaag 


aaataacaag 


agtaaaaaga 


caaagtaaaa 


2511 


gtgatcatga 


aacctctggt 


gecgagaagt 


ctgtagagga 


cagggggaga 


agatgttcaa 


2571 


ccccagaagt 


acagaaaaag 


tcaacaagag 


atgtgtccca 


tacatctgeg 


acaggggatt 


2631 


caggttcagg 


caagectttt 


aagattaaag 


gacaaccaga 


aactggaatt 


ctaaggacag 


2691 


aaacttttag 


agaggataca 


gatgatactk 


aaagaaataa 


aactcaacga 


gaaccttcga 


2751 


taatagcacg 


aagtgaagaa 


atggggagga 


tggtaccagg 


actgccttca 


ggctgggcca 


2811 


agtttcttga 


tccaatcact 


ggaacgtttc 


attattatca 


ttcaccacta 


acactgttca 


2871 


tatgtaccca 


ctggaaatgg 


ctccttcatc 


tgcacctcct 


tccacccetc 


caactcataa 


2931 



10 



15 



20 



25 



30 



35 



40 



45 



50 



aggcaagcca cagattcctg ctaagcagga tagggaacct tccaaactga aatgctctta 
ctcctcccca gatataaccc aggctattca agaggaagcc agcagtaact ccaacagtta 
atcaggaaga caagccaaca tgctacccta aagctgagat ctcaaggctt tctgcttctc 
agatttggaa actcartcct gttttyrgag gttctggacc agctcttact ggacttcgta 
acttaggaaa trcttgttat atgaactcaa tattgcagtg cctatgtaat sctccacatt 
tggctgatta tttcaaccga aactgttatc aggatgayat taacaagtca aatttgttag 
gggcataaag gtgaagtggc agaagaattt ggtataatca tgaaagcccc gtggacagga 
cagtatagat atatcagtcc aaaagacctt aaagtcacca ttgggaagat caattaccag 
tttgcaggat acagtcaaga ttcacaagaa tttyttctgt tcctaatgga tggtctccrt 
gaagatctaa ataaaactga taatcggaag acatataaag aagaaaataa tgatcatctc 
aatgacttta aagctgcaga acatgcctgg cagaaacaca agcggctcta tgagtctatt 
attgttgcac tttttcaggg tcaattcaaa tctacagtac agtgcctcac ccgtcacaaa 
aagtctagga cacttgaggc cttcatgtat ttgtctctac tgatagcatc cacaagtaaa 
tgtacattat aggattgcct tagattattt tctaaagaag aaaaactcat agataataac 
agattttact gcaatctttg cagagctcga cgggattctt aaaaaagaaa tctggaagtt 
accacctgtg cttttagtgc atctgaaaca tttttcctac aatggcaggt ggaaacaaaa 
attacagaca tctgtggact tccygttaga aaatcttgcc ttgtcacagt atgttattgg 
tccaaagaac aatttgaaga amtataattt gttttctgtt tcagatcact gcggtgggct 
ggatggaggc cattacacag cctactgtaa aaatgcagca aaacagcggt ggtttaagtt 
tgatgatcat gaagtttctg atatctctgt ttcttctgtg aaatcttcag cagcttatat 
cctcttttat acttctttgg gaccatgagt aactgatgta ggcacataag gagacatagg 
ttataaacta gttatctttt aaaaggctca gcaacacaat tcttgaaatg cttatcaaga 
tartggtagc aatagctggc catttagagg aattctagga cagtgggagc tgtgttacta 
gcactatata attcctgtca gtggtgacaa ataacactta acaagtattg cagtaagcat 
cacttacagg taccatttat ttcaaaacaa cttttttagt ctgctccaaa gttaaaataa 
ttaactagct aagcattatt attctactgg tctaaaaacc wttgtaccct ttttttcctt 



2991 
3051 
3111 
3171 
3231 
3291 
3351 
3411 
3471 
3531 
3591 
3651 
3711 
3771 
3831 
3891 
3951 
4011 
4071 
4131 
4191 
4251 
4311 
4371 
4431 
4491 



10 



20 



ttcactgtta cagccttttc acatttctaa atcccatctt catatactat gaatactcta 4551 

gaatgatgtg aagcagatag gaatgtatgt gtacatattt attgcatact tacacatcaa 4611 

atcgatatac atagtttaac atgtggtcct ttcgtgaaac ttagaactya gaggattgca 4671 

tttttttctt tgagcatatt ttgagtaact gcagtgcttt cttagggaaa tgacagggca 4731 

aagctatttt tctgttggct ttgggggcat ttgggtgcrc taaatcttta tcttaaaaaa 4791 

taaatggaaa cttcctttaa ttttttaaaa tgagacatta aaatcttaat gagaaaaatt 4851 

taaaaagctc aatatcactg ctcattagag aaatgtaaat caaagccaca atgagatacc 4911 

15 atctcccrcc agtcagaatg gtaattatta aaaagtcaag aaacaataga tgctggtgag 4971 

gytgtggaga aataggaaca cttttacact gttgttggga atgtaaacta gttcaaccat 5031 

tgtggaagac agtgtggcca ttcctcagag ayckagaacc agaaatayya tttgacscyt 5091 

tgggtatcta cccaaaggaa tataaatcat tctactataa agacacatgc acacgtatgt 5151 

ttactgcagc actatttaca atagcaaaga cttggaacca acccaaatgt ccatcagtga 5211 

25 tagatggata aagaaaatgt ggtgcatacc accatggaat agtacacagc cagaaaaagg 5271 

aatgagttca tgtcctttgc agggacatgg atgaagctgg aagtcatcat cctcagcaaa 5331 

ctaacacggg aacagaaaac aaagcacctc atgttctcat tcctaagtga gagttgaaca 5391 

atgacaacac atggayacag ggaggggaac aacacatatc agggcctttt ggggagtgtg 5451 

gggggcaagg gacgrgaact tagaggatgg gtcaataggt gcagcaaacc accatggcag 5511 

35 actatacgya tgtaacaaac ctgcaggttc tgcacatgta tcctggaacc taaagtaaaa 5571 

taaaacaaag caaattaaaa aaagaaagcc catgtcttac atgtatgcat atgttcattg 5631 

cagcactatt cacaatagca aagacatgga atcaacctaa atgtccatca atggtagact 5691 

ggataaagaa aatgtggcaa atatgctcta ccggcaggat ttgatggcgt gatgtctcac 5751 

agaaagttct ccactcccag acatgggtcc ctcggcttcc tgccttggaa gcacagcagc 5811 

45 aggcatcgtg ggaaggtgaa gagcttccct aaggatgacc catccaagcc ggtccacctc 5871 

acagccttcc tgggatacaa ggctggcatg acccacatcg tgcgggaagt cgacaggcca 5931 

ggatccaagg tgaacaagaa ggaggtggtg gaggctgtga ccattgtgga gaggccacca 5991 

gtgggcattg tgggctgcgt ggaaacccct caaggcttcc ggacttgcaa gactgtcttc 6051 



30 



40 



50 



gctgagcaca tcagtgatga atgcaagagg cgtttctata agaactggca taaatctaag 6111 

aagaaggcct ttaccaagta ctgcaagaaa tggcaggatg aggatggcaa gaagcagctg 6171 

5 gagaaggact tcagcagcat gaagaagtac tgccaagtca tctgcgtcat tgcccacacc 6231 

cagatgcacc tgcttcctct gtgccagaag aaggcccacc tgatggagat ccaggtgaat 6291 

ggaggcactg tggctgagaa gctggactgg gctggcgaga ggctcaagca ccaggtacct 6351 

10 gtgaaccaag tgtttgggca ggatgagatg atcgacgtca tcagggtgac caagggcaaa 6411 

ggctacaaaa gggtcaccag tcattggcac accaagaagc tgccccgcaa gacccaccaa 6471 

15 ggcctgtgca aggtggcctg tattggggca tggcatcctg ctcgtgtggg cttctctgtg 6531 

gtacgtggtg ggcagaaagg ctaccatcac cgcactgaga tcaacaagat ctataggatt 6591 

ggctagggct accttatcaa ggatggcaag ctgatcaaga acaatgcctc cactgactat 6651 

20 gacctgtctg acaagagcat caaccctttg ggtggcttcg tccactatgg tgaagtgacc 6711 

aatgactttg tcatgctgaa aggctgtgtg gtgggaacca agaagtgggt gctcaccctc 6771 

25 cacaagtcct tgctggtgca gacaaagcag cgggctctgg agaagattga ccttaagttc 6831 

attgacacct cctccaagtt tggccatggc cgcttccaga ccacggagga gaagaaagca 6891 

ttcatgggac cactcaagaa agaccgaatt gcaaaggaag aaggagctta atgctgggaa 6951 

30 cagatattgc aactggtggg atctcaataa aagttatttt ccattaaaaa aaaaagaaaa 7011 

agaaaatgtg gcacatatac accacagaat accatgcagc cataaaaaag aatgagatca 7071 

35 tgtcctttgc aggaacatgg atggagttgg aggccattat ccttagcaaa ctgaggcagg 7131 

aacagaaaac caattaccac atgttctcac ttataagtag gagttatatg atgagaacac 7191 

atggacacac agaagggaac aacacacact ggggtccact tgagggtaga gggtgggagg 7251 

40 7260 
agggagagg 



<210> 2 

45 <211> 1449 

<212> DNA 

<213> Homo sapiens 

<220> 

50 <221> CDS 

<222> (1)..(1449) 
<223> 



<220> 

<221> misc_feature 

<222> (1) (167) 

5 <223> exon 1 



<220> 

<221> misc__feature 

10 <222> (168) . . (796) 

<223> exon 2 



<220> 

15 <221> misc_feature 

<222> (797) . . (1449) 

<223> exon 3 



20 <400> 2 

atg atg aga get cac atg ttt gtt tac aag gaa ctt aaa caa att tac 
Met Met Arg Ala His Met Phe Val Tyr Lys Glu Leu Lys Gin He Tyr 
1 5 10 15 

25 aag aaa aaa acc cat ccc cat caa aaa gtg ggc aaa gga tat aaa cag 
Lys Lys Lys Thr His Pro His Gin Lys Val Gly Lys Gly Tyr Lys Gin 
20 25 30 



aag etc aca cac gta tat gaa aca act gat ttc aag caa cag cag tat 
Lys Leu Thr His Val Tyr Glu Thr Thr Asp Phe Lys Gin Gin Gin Tyr 
35 50 55 60 



40 



gaa act gaa cga etc tct gaa age ctt aaa eta aga tat gaa gaa gtt 
Glu Thr Glu Arg Leu Ser Glu Ser Leu Lys Leu Arg Tyr Glu Glu Val 
85 90 95 

45 gaa ate tgg aaa aaa ctt gag gaa aag gac arg cag ggg gaa gca cag 
Glu He Trp Lys Lys Leu Glu Glu Lys Asp Xaa Gin Gly Glu Ala Gin 
100 105 110 

tgg eta caa caa aaa agg cag gaa aca gga aga gag gat ggc age ayg 
50 Trp Leu Gin Gin Lys Arg Gin Glu Thr Gly Arg Glu Asp Gly Ser Xaa 
115 * 120 125 



48 



96 



aca ctt etc aga gga aga cat tta cgt ggc caa gaa aca tat gaa aaa 144 
30 Thr Leu Leu Arg Gly Arg His Leu Arg Gly Gin Glu Thr Tyr Glu Lys 
35 40 45 



192 



tac ttc cat tea ata ctt gga cct gca aac ate aaa aaa gec act gga 240 
Tyr Phe His Ser He Leu Gly Pro Ala Asn He Lys Lys Ala Thr Gly 
65 70 75 80 



288 



336 



384 



290 295 

45 cc= «a «, r |» gj -* |« £ »» ?S E E E 22 E 
Pro Ser Leu Glu Glu Ser lie Pro &er uy& 32Q 

305 310 

" c - r £ SS S3 SS E s s is 2 S IS £ e 

50 Pro Pro Pro He Xaa vax Asp 335 



480 



528 



576 



624 



2 s S Sg 2 2 s s e 2 s s £ £ £ = 

130 135 

; caa aag age aat ggt gaa aag aat gaa aaa tgt gag acc aaa gag aaa 

Gin Lys Ser Asn Gly Glu Lys Asn Glu Lys Cys Bin my 
145 150 I 55 

. S E S £ S 2 E 2 S i 2 S = - i = 

165 

S S 2 £ 2 £ E E £ 2? £ £ £ 2 £ 

5 180 185 

2 S £ £ E £ £ £ « £ £ £ I s s = 

195 200 

" E e £ £ E 3 £ E £ £ £ £ S S 2 £ 

210 215 

* gay aca tg9 aa 9 jjj ajg |f aa t £, « £ «| 9g «J «t jj. 
Asp Thr Trp Lys Lys Arg Gly Asn vax ^xu y 24Q 
225 230 

tgg ttt agt tct gca aaa gat tta cag att rga aca acw etc tgg cat 
30 Trp Phe Ser Ser Ala Lys Asp Leu Gin lie Xaa Thr Xaa u rp 

245 250 

2 £ 5 E £ £ 2 5 E S £ S £ i £ 2 

35 4 260 265 

£ £ S £ £ 2 S £ £ £ £ 2 S S = ? " 

275 280 
40 4-+-~ +-,-t- ai-r tea tta gat ttt act tat 912 

£ E K £ £ E E 2 £ S £ -S Lp - - ^ 



720 



768 



816 



864 



960 



1008 



ata agt gat aat gat caa aat gag agg aca gga cca ctg aat ata tea 
lie Ser Asp Asn Asp Gin Asn Glu Arg Thr Gly Pro Leu Asn lie Ser 
340 345 350 



1056 



att cca gtt gaa tea gtt get get tct aaa tct gat gtt tea ccc ate 
lie Pro Val Glu Ser Val Ala Ala Ser Lys Ser Asp Val Ser Pro He 
355 360 365 



1104 



att cag cca gtg cct age ata aag aat gtt cca cag att gat cat act 
10 He Gin Pro Val Pro Ser He Lys Asn Val Pro Gin lie Asp His Thr 
370 375 380 



1152 



15 



aaa aaa ctg gca gtc aaa ttg cct gaa gag cat ata ate aaa tct gaa 
Lys Lys Leu Ala Val Lys Leu Pro Glu Glu His He He Lys Ser Glu 
385 390 395 400 



1200 



20 



agt aca aat cat gag caa cag tct cct cag aat gaa aaa gtt att cct 1248 
Ser Thr Asn His Glu Gin Gin Ser Pro Gin Asn Glu Lys Val He Pro 
405 410 415 

gat tgt tec rcc aag cca gta gtt tec tct cca act etc atg tta aca 1296 
Asp Cys Ser Xaa Lys Pro Val Val Ser Ser Pro Thr Leu Met Leu Thr 
420 425 430 



25 gat gaa gaa aag get cat att cat gca gaa act get ctt eta atg gag 
Asp Glu Glu Lys Ala His He His Ala Glu Thr Ala Leu Leu Met Glu 
435 440 445 



1344 



aaa aac aaa caa gaa aaa gaa ctt cag gaa aga cag caa ggg aaa cag 
30 Lys Asn Lys Gin Glu Lys Glu Leu Gin Glu Arg Gin Gin Gly Lys Gin 
450 ~ 455 460 



1392 



35 



aaa gaa act gag gag gga aga aca cga gca aaa age caa aaa gaa aca 
Lys Glu Thr Glu Glu Gly Arg Thr Arg Ala Lys Ser Gin Lys Glu Thr 
465 470 ' 475 480 



1440 



aga age tga 
Arg Ser 



1449 



40 



<210> 
<211> 
<212> 



3 

482 
PRT 



45 <213> Homo sapiens 



<220> 
<221> 
<222> 
50 <223> 



misc_feature 
(107) . . (107) 

The f Xaa' at location 107 stands for Arg, or Lys. 



<220> 



<221> misc_feature 

S£ £e"an location 12. stands for Th r, on Met. 



5 <220> 

<221> miscjfeature 



tllll inf^at location 183 stands for Asp, or His. 



10 <220> 

<221> misc_feature 



tllll ^cation 189 stands for Asp, on T y n. 



15 <220> 

<221> misc_feature 



t£l i„5i;a'°an location 203 stands £ or Gly, on Ola. 



20 <220> 

<221> misc_feature 



ih^iaa^a't location 204 stands for on Lys. 



25 <220> 

<221> misc_feature 



<^f> in.^ial'°at location 209 stands for Sly, on Val. 



30 <220> 

<221> miscJEeature 



S£ Jne^ia.'"* location 213 stands for San. 



35 <220> 

<221> misc__feature 



ilZl ihe^x; a 25 al: location 251 stands for Gly, or Arg. 



40 <220> 

<221> mis cofeature 



? 2 t£ S 3 '.x«" 3 an location 253 stands fen Thn. 



45 <220> 

<221> misc_feature 



<llll ihe 5 ^;l 32 at location 325 stands for Glu, or Lys. 



50 <220> 

<221> xnisc_feature 
<222> (420) . . (420) 



<223> The f Xaa f at location 420 stands for Ala, or Thr. 



<400> 3 

5 Met Met Arg Ala His Met Phe Val Tyr Lys Glu Leu Lys Gin He Tyr 
15 10 15 

Lys Lys Lys Thr His Pro His Gin Lys Val Gly Lys Gly Tyr Lys Gin 
10 " 20 25 30 

Thr Leu Leu Arg Gly Arg His Leu Arg Gly Gin Glu Thr Tyr Glu Lys 
35 40 45 

15 

Lys Leu Thr His Val Tyr Glu Thr Thr Asp Phe Lys Gin Gin Gin Tyr 
50 55 60 

20 

Tyr Phe His Ser He Leu Gly Pro Ala Asn He Lys Lys Ala Thr Gly 
65 70 75 80 

25 Glu Thr Glu Arg Leu Ser Glu Ser Leu Lys Leu Arg Tyr Glu Glu Val 

85 90 95 

Glu He Trp Lys Lys Leu Glu Glu Lys Asp Xaa Gin Gly Glu Ala Gin 
30 ~ 100 105 110 

Trp Leu Gin Gin Lys Arg Gin Glu Thr Gly Arg Glu Asp Gly Ser Xaa 
115 120 125 

35 

Leu Ala Lys Gly Ser Leu Glu He Val Leu Asp Ser Lys Asp Lys Thr 
130 ^ 135 140 

40 

Gin Lys Ser Asn Gly Glu Lys Asn Glu Lys Cys Glu Thr Lys Glu Lys 
145 150 155 160 

45 Gly Ala He Thr Ala Lys Glu Leu Tyr Thr Met Met Met Asp Lys Asn 

165 170 175 



He Ser Leu He He Met Xaa Ala Gin Arg Met Gin Xaa Tyr Gin Asp 
50 180 185 190 



Ser Cys He Leu His Ser Leu Ser Val Pro Xaa Xaa Ala He Ser Pro 



195 



5 Xaa val Thr Ala Xaa Trp He Glu Ala His Leu Pro Tyr Asp Ser He 
210 215 



ASP Thr Trp Lys Lys Arg Gly Asn Val Glu Tyr Met Val Leu Leu Asp 
225 230 

Trp Phe Ser Ser Ala Lys Asp Leu Gin He Xaa Thr Xaa Leu Trp His 



245 250 



15 

Leu Lys Asp Ala Leu Phe Lys Trp Glu Lys Gly Gly Tyr Lys Asn Trp 
260 265 



20 Phe Leu Cys Tyr Ser Gin Tyr Thr Thr Asn Ala Lys Val Thr Pro Pro 
275 280 

25 Pro Gin His Gin Asn Glu Glu Leu Ser He Ser Leu Asp Phe Thr Tyr 
290 295 



Pro Ser Leu Glu Glu Ser He Pro Ser Lys Pro Ala Ala Glu Met Pro 
30 305 



310 315 



Pro Pro Pro He Xaa Val Asp Glu Asp He Glu Leu He Ser Asp Gin 
325 - 1 ^ u 

35 



40 



lie Ser Asp Asn Asp Gin Asn Glu Arg Thr Gly Pro Leu Asn He Ser 
340 345 

lie Pro Val Glu Ser Val Ala Ala Ser Lys Ser Asp Val Ser Pro He 
355 



45 He Gin Pro Val Pro Ser He Lys Asn Val Pro Gin He Asp His Thr 
370 375 

Lys Lys Leu Ala Val Lys Leu Pro Glu Glu His He He Lys Ser Glu 
50 385 390 



Ser Thr Asn His Glu Gin Gin Ser Pro Gin Asn Glu Lys Val lie Pro 
405 410 415 



5 Asp Cys Ser Xaa Lys Pro Val Val Ser Ser Pro Thr Leu Met Leu Thr 
420 425 430 

Asp Glu Glu Lys Ala His lie His Ala Glu Thr Ala Leu Leu Met Glu 
10 435 " 440 445 

Lys Asn Lys Gin Glu Lys Glu Leu Gin Glu Arg Gin Gin Gly Lys Gin 
450 455 460 

15 

Lys Glu Thr Glu Glu Gly Arg Thr Arg Ala Lys Ser Gin Lys Glu Thr 
465 470 475 480 

20 

Arg Ser 



25 <210> 4 

<211> 1118 

<212> PRT 

<213> Homo sapiens 

30 <400> 4 

Met Pro Ala Val Ala Ser Val Pro Lys Glu Leu Tyr Leu Ser Ser Ser 
15 10 15 

35 

Leu Lys Asp Leu Asn Lys Lys Thr Glu Val Lys Pro Glu Lys lie Ser 
20 ~ 25 30 

40 Thr Lys Ser Tyr Val His Ser Ala Leu Lys He Phe Lys Thr Ala Glu 
35 " 40 45 

Glu Cys Arg Leu Asp Arg Asp Glu Glu Arg Ala Tyr Val Leu Tyr Met 
45 50 55 60 

Lys Tyr Val Thr Val Tyr Asn Leu He Lys Lys Arg Pro Asp Phe Lys 
65 " 70 75 80 

50 

Gin Gin Gin Asp Tyr Phe His Ser He Leu Gly Pro Gly Asn He Lys 



85 



90 



95 



Lys Ala val Glu Glu Ala Glu Arg Leu Ser Glu Ser Leu Lys Leu Arg 
5 100 105 

TY r Glu Glu Ala Glu Val Arg Lys Lys Leu Glu Glu Lys Asp Arg Gin 
115 120 

10 

Glu Glu Ala Gin Arg Leu Gin Gin Lys Arg Gin Glu Thr Gly Arg Glu 
130 135 

15 Asp Gly Gly Thr Leu Ala Lys Gly Ser Leu Glu Asn Val Leu Asp Ser 
145 150 

20 Lys Asp Lys Thr Gin Lys Ser Asn Gly Glu Lys Asn Glu Lys Cys Glu 
Thr Lys Glu Lys Gly Ala He Thr Ala Lys Glu Leu Tyr Thr Met Met 

25 



180 185 



30 



35 



Thr Asp Lys Asn He Ser Leu lie He Met Asp Ala Arg Arg Met Gin 
195 

Asp Tyr Gin Asp Ser Cys He Leu His Ser Leu Ser Val Pro Glu Glu 
210 215 

Ala He Ser Pro Gly Val Thr Ala Ser Trp He Glu Ala His Leu Pro 
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40 Asp Asp Ser Lys Asp Thr Trp Lys Lys Arg Gly Asn Val Glu Tyr Val 
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Val Leu Leu Asp Trp Phe Ser Ser Ala Lys Asp Leu Gin He Gly Thr 
45 260 265 



Thr Leu Arg Ser Leu Lys Asp Ala Leu Phe Lys Trp Glu Ser Lys Thr 
275 280 
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val Leu Arg Asn Glu Pro Leu Val Leu Glu Gly Gly Tyr Glu Asn Trp 
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Leu Leu Cys Tyr Pro Gin Tyr Thr Thr Asn Ala Lys Val Thr Pro Pro 
5 305 310 315 320 
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Pro Arg Arg Gin Asn Glu Glu Val Ser lie Ser Leu Asp Phe Thr Tyr 
325 330 335 

Pro Ser Leu Glu Glu Ser He Pro Ser Lys Pro Ala Ala Gin Thr Pro 
340 345 350 
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Pro Ala Ser He Glu Val Asp Glu Asn lie Glu Leu He Ser Gly Gin 
355 360 365 

20 Asn Glu Arg Met Gly Pro Leu Asn lie Ser Thr Pro Val Glu Pro Val 
370 375 380 

Ala Ala Ser Lys Ser Asp Val Ser Pro lie He Gin Pro Val Pro Ser 
25 385 390 395 400 
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He Lys Asn Val Pro Gin He Asp Arg Thr Lys Lys Pro Ala Val Lys 
405 410 415 



Leu Pro Glu Glu His Arg He Lys Ser Glu Ser Thr Asn His Glu Gin 
420 425 430 
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Gin Ser Pro Gin Ser Gly Lys Val He Pro Asp Arg Ser Thr Lys Pro 
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40 Val Val Phe Ser Pro Thr Leu Met Leu Thr Asp Glu Glu Lys Ala Arg 
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He His Ala Glu Thr Ala Leu Leu Met Glu Lys Asn Lys Gin Glu Lys 
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Glu Leu Arg Glu Arg Gin Gin Glu Glu Gin Lys Glu Lys Leu Arg Lys 
485 490 495 
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Glu Glu Gin Glu Gin Lys Ala Lys Lys Lys Gin Glu Ala Glu Glu Asn 
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Glu He Thr Glu Lys Gin Gin Lys Ala Lys Qlu Gin Met Gin Lys Lys 
5 515 



520 525 



Glu Ser Glu Gin Ala Lys Lys Glu Asp Lys Glu Thr Ser Ala Lys Arg 
530 535 540 
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Gly Lys Glu lie Thr Gly Val Lys Arg Gin Ser Lys Ser Glu His Glu 
545 550 555 500 



15 Thr Ser Asp Ala Lys Lys Ser Val Glu Asp Arg Gly Lys Arg Cys Pro 

565 570 3 '3 

20 Thr Pro Glu He Gin Lys Lys Ser Thr Gly Asp Val Pro His Thr Ser 



Val Thr Gly Asp Ser Gly Ser Gly Lys Pro Phe Lys lie Lys Gly Gin 
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595 " * " 600 605 



Pro Glu Ser Gly He Leu Arg Thr Gly Thr Phe Arg Glu Asp Thr Asp 
610 615 620 
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Asp Thr Glu Arg Asn Lys Ala Gin Arg Glu Pro Leu Thr Arg Ala Arg 
625 630 635 



35 



Ser Glu Glu Met Gly Arg He Val Pro Gly Leu Pro Ser Gly Trp Ala 
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40 Lys Phe Leu Asp Pro He Thr Gly Thr Phe Arg Tyr Tyr His Ser Pro 
660 665 °' u 

Thr Asn Thr Val His Met Tyr Pro Pro Glu Met Ala Pro Ser Ser Ala 
45 675 680 685 



Pro Pro Ser Thr Pro Pro Thr His Lys Ala Lys Pro Gin He Pro Ala 
690 

50 



695 700 



Glu 



Arg Asp Arg Glu Pro Ser Lys Leu Lys Arg Ser Tyr Ser Ser Pro 
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710 
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720 



Asp lie Thr Gin Ala lie Gin Glu Glu Glu Lys Arg Lys Pro Thr Val 
725 730 735 



Thr Pro Thr Val Asn Arg Glu Asn Lys Pro Thr Cys Tyr Pro Lys Ala 
740 745 750 



Glu lie Ser Arg Leu Ser Ala Ser Gin lie Arg Asn Leu Asn Pro Val 
755 760 765 



Phe Gly Gly Ser Gly Pro Ala Leu Thr Gly Leu Arg Asn Leu Gly Asn 
770 ^ 775 780 



Thr Cys Tyr Met Asn Ser He Leu Gin Cys Leu Cys Asn Ala Pro His 
785 790 795 800 



Leu Ala Asp Tyr Phe Asn Arg Asn Cys Tyr Gin Asp Asp lie Asn Arg 
805 810 815 



Ser Asn Leu Leu Gly His Lys Gly Glu Val Ala Glu Glu Phe Gly He 
820 825 830 



He Met Lys Ala Leu Trp Thr Gly Gin Tyr Arg Tyr lie Ser Pro Lys 
835 840 845 



Asp Phe Lys He Thr He Gly Lys He Asn Asp Gin Phe Ala Gly Tyr 
850 855 860 



Ser Gin Gin Asp Ser Gin Glu Leu Leu Leu Phe Leu Met Asp Gly Leu 
865 870 875 880 



His Glu Asp Leu Asn Lys Ala Asp Asn Arg Lys Arg Tyr Lys Glu Glu 
885 890 895 



Asn Asn Asp His Leu Asp Asp Phe Lys Ala Ala Glu His Ala Trp Gin 
900 905 910 



Lys His Lys Gin Leu Asn Glu Ser He He Val Ala Leu Phe Gin Gly 



915 



920 925 



Gin Phe Lys Ser Thr Val Gin Cys Leu Thr Cys His Lys Lys Ser Arg 
5 930 935 940 

Thr Phe Glu Ala Phe Met Tyr Leu Ser Leu Pro Leu Ala Ser Thr Ser 
945 950 

10 

Lys Cys Thr Leu Gin Asp Cys Leu Arg Leu Phe Ser Lys Glu Glu Lys 
965 970 

15 Leu Thr Asp Asn Asn Arg Phe Tyr Cys Ser His Cys Arg Ala Arg Arg 
980 985 

20 Asp Ser Leu Lys Lys He Glu m Trp Lys Leu Pro Pro^ Val Leu Leu 

Val His Leu Lys Arg Phe Ser Tyr Asp Gly Arg Trp Lys Gin Lys 
25 1010 1015 1020 



L eu Gin Thr Ser Val Asp Phe Pro Leu Glu Asn Leu Asp Leu Ser 
1025 1030 1° 35 
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Gin Tyr Val He Gly Pro Lys Asn Asn Leu Lys Lys Tyr Asn Leu 
1040 1° 45 1050 



35 Phe Ser Val Ser Asn His Tyr Gly Gly Leu Asp Gly Gly His Tyr 
1055 1° 60 106 

40 Thr Ala Tyr Cys Lys Asn Ala Ala Arg Gin Arg TrP Q Phe Lys Phe 
1070 1075 

Asp Asp His Glu val ser .j. He Ser val Ser Ser val lys Ser 
45 1085 1090 



Ser Ala Ala Tyr He Leu Phe Tyr Thr Ser Leu Gly Pro Arg Val 
1100 llu5 
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Thr Asp Val Ala Thr 



1115 
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